Vol. 57, No. 2 March, 1950 


Psychological Review 


EDITED BY 


CARROLL C. PRATT 
PRINCETON UNIVERSITY 





CONTENTS 


On the Relation Between Similarity and Transfer of Training in the 
Learning of Discriminative Motor Tasks: 
R. M. Gacnfé, KaTnerine E. BAKER AND Harriet Foster 


A Stimulus-Response Analysis of the Hoarding Habit in the Rat: 
Metvin H. Marx 


Toward A Statistical Theory of Learning: Wititam K. Estes 
Behavior, Perception and the Psychophysical Methods: C. H. GRanAM 


Description of the Learned Response in Discrimination Behavior: 
Henry W. NISSEN 





PUBLISHED BI-MONTHLY BY THE 
AMERICAN PSYCHOLOGICAL ASSOCIATION, INC. 
PRINCE AND LEMON STS., LANCASTER, PA. 
AND 1515 MASSACHUSETTS AVE., N. W., WASHINGTON 5, D. C. 
$5.50 volume $1.00 issue 
Entered as second-class matter July 13, 1897, at the post-office at Lancaster, Pa., under Act of Congress of 
March 3, 1879 


fetiatiincs tet wala st the cate of postage ivevided ter ts 4-2), Section 34.40, 
“RTT oe ae 





MEMBERSHIP RULES 


IN THE 


AMERICAN PSYCHOLOGICAL ASSOCIATION 





There are three classes of membership in the American Psychological 
Association: Associate, Fellow, and Life Member. 


A. The largest class of membership is Associate. In order to qualify 
as an Associate an applicant must meet one of three sets of requirements: 


1. He must have a doctor’s degree based in part upon a psychological 
dissertation and conferred by a graduate school of recognized standing; or 


2. He must have completed two years of graduate work in psychology 
at a recognized graduate school and be devoting full time to work or graduate 
study that is primarily psychological in character; or 


3. He must have completed one year of graduate study plus one year 
of professional work in psychology and be devoting full time to work or 
graduate study that is primarily psychological in character. 


Scientists, educators, or other distinguished persons who are known to 
and recommended by the Board of Directors may also be elected as Asso- 
ciates of the American Psychological Association. 


Applicants must have their applications complete by September 15. New 
Associates are elected in the fall and their membership will be dated as of 
the next year. Journals due members will begin with the January issues. 
Annual dues for Associates are now $12.50. 


B. Properly qualified Associate members may, upon nomination by one 
of the Divisions and election by the Council of Representatives, become 
Fellows of the American Psychological Association. Fellows must previ- 
ously have been Associates. They must have a doctor’s degree and at 
least five years of acceptable professional experience beyond that degree. 
They must be primarily engaged in the advancement of psychology as a 
science and a profession. Annual dues for Fellows are now $17.50. In the 
American Psychological Association, no one is made a Fellow except at his 


own request. 


C. Life Membership is open to members who have reached the age of 
65 and who have been members for twenty years. They are exempt from 
dues, and receive the American Psychologist and the Directory. 





VoL. 57, No. 2 


Marcu, 1950 


THE PSYCHOLOGICAL REVIEW 





ON THE RELATION BETWEEN SIMILARITY AND 
TRANSFER OF TRAINING IN THE LEARNING 
OF DISCRIMINATIVE MOTOR TASKS * 


BY R. M. GAGNE,* KATHERINE E. BAKER 
Connecticut College 
AND 
HARRIET FOSTER 
University of Michigan 


INTRODUCTION 


It has long been recognized that the 
amount of transfer of training which 
takes place between one learning task 
and another depends upon the similarity 
of the tasks. A generally accepted rule 
is that in order to assure a high degree 
of transfer from one learned activity to 
a second, the first task must be made 
to resemble the second closely. The 
experimental studies which have been 
made of this relationship, however, have 
not arrived at conclusions which are in 
unanimous agreement with this point. 
Although some studies have indeed 
shown that transfer increases in amount 
as the two tasks are made more similar, 
there are some which have shown that 
increasing the similarity between tasks 


1 The formulation presented in this article 
grew out of research under Contract N7onr- 
316, Task Order I, between the Special De- 
vices Center, Office of Naval Research, and 
Connecticut College. The paper is’a con- 
densed version of Report No. 316-1-5 under 
that contract. 

* Now at Perceptual and Motor Skills Re- 
search Laboratory, AFTRC Human Resources 
Research Center, Lackland Air Force Base, 
Texas. 


has the opposite effect of decreasing the 
amount of transfer. The existence of 
a conflict in these results seems to indi- 
cate that the factors in the situation 
which have been varied in similarity 
may not have been adequately iden- 
tified or controlled in the different 
studies. At any rate, the principles 
which have emerged from these experi- 
mental studies do not at present permit 
the consistent prediction of transfer 
effects from a knowledge of the nature 
of the two tasks which are to be learned. 

The present paper examines the con- 
cept of similarity as it applies to the 
transfer of training between motor skills 
requiring the acquisition of two or more 
discriminative habits. In contrast to 
the usual statement of the relation be- 
tween similarity and transfer, it is here 
proposed that transfer between such 
tasks may best be understood in terms 
of the similarity among the stimuli and 
among the responses within each of the 
tasks to be learned, rather than between 
the elements of the first and second 
tasks (5, 13). It is contended that the 
attempt to vary the similarity of stimuli 
or of responses from one task to another 





68 R. M. Gacnt, K. E. Baker, AND H. Foster 


usually brings about variations in simi- 
larity among the elements within the 
task. If these changes in similarity 
within the task are understood in the 
manner to be described, they provide a 
basis for the prediction of transfer from 
one task to another. 

It will be shown that variation be- 
tween a first task and a second in the 
transfer situation can be brought about 
in a number of ways: (a) by making 
the elements within the second task 
more alike; (b) by making the ele- 
ments within the second task more un- 
like; (c) by displacing the elements 
within the second task in the same di- 
rection along a scale of similarity; and 
finally (d) by reversing the relation- 
ships between the stimuli and responses 
in the second task. Our analysis will 
attempt to show that the first three 
kinds of variation may be expected to 
result in positive transfer between the 
two learning tasks by an amount which 
is related to the degree of such varia- 
tion. Reversal, on the other hand, will 
be shown to be the condition of nega- 
tive transfer, the amount of which in 
turn depends upon the degree of simi- 
larity of elements within each task.? 

One concept to be used in account- 
ing for the relationship between simi- 
larity and transfer is the gradient of 
stimulus generalization (7, 9). We as- 
sume that when an individual acquires 
a response to a given stimulus situation, 
he tends to make the same response in 
other situations which are similar to the 
original one. For present purposes 


2It should be noted that it is not intended 
to describe a quantitative theory of transfer. 
The assumptions to be made in arriving at 
the principles which we shall describe are not 
novel ones, but have been used to varying 
extents by other authors (4, 6, 15, 12). We 
are interested only in discovering the implica- 
tions of some of these basic assumptions for 
the problem of the relation between simi- 
larity and transfer in discriminative motor 
learning. 


similarity is defined in terms of sub- 
jects’ judgments. It is assumed that 
as stimuli become decreasingly similar, 
the strength of generalized response 
tendencies between them will also de- 
crease. In human learning situations 
we are probably never dealing with a 
purely primary gradient (10), but with 
a gradient whose form is partly deter- 
mined by secondary or mediated gen- 
eralization built up by a common verbal 
(or other internal) response to two 
different stimuli (such as two different 
“reds”). As Gibson’s (5) results show, 
gradients of generalization undergo a 
progressive reduction in range as train- 
ing is continued, i.¢e., as the required 
discriminations become more firmly 
established (8). 

Although the evidence for its useful- 
ness is scanty, we propose also to use 
the concept of the response generaliza- 
tion gradient. We assume that when an 
individual learns a specific response to 
a given stimulus, he also acquires tend- 
encies to make similar responses to the 
same stimulus, such tendencies decreas- 
ing in strength as the degree of simi- 
larity between the responses and the 
original learned response decreases. In 
the case of responses, a definition of 
similarity is sought in terms of the 
results of movement rather than in 
characteristics of the muscular activity 
itself. For purposes of the present an- 
alysis, responses are conceived to vary 
in the aspects of direction, distance, 
force and rate, and similarities between 
them are discussed with reference to 
these characteristics as they apply to 
the movement of a single body member. 
As in the case of stimulus generaliza- 
tion, it is assumed that the effect of 
continued training is progressively to 
narrow the range of the response gen- 
eralization gradient, making the re- 
sponses to different stimuli more and 
more distinctive. 





RELATION BETWEEN SIMILARITY AND TRANSFER OF TRAINING 69 


TRANSFER RELATED TO VARIATION IN 
THE Motor LEARNING TASK 


The Effects of Stimulus Variation 
on Transfer 


It should now be possible to apply 
these concepts to a situation which in- 
volves the acquis‘tion of discriminative 
motor habits. In order to illustrate the 
factors which may be subjected to 
variation in a motor learning task in 
which differential responses are made 
to discriminated stimuli, a simple ex- 
ample will be used which seems cap- 
able of representing in its elements a 
variety of motor skills. Our standard 
task is one in which a subject is re- 
quired to learn to make a manual re- 
sponse to a key located on the right on 
the appearance of a red stimulus light, 
and to make a response with the same 
hand to a key located on the left on 
the appearance of a green light. In 
the discussion which follows, this con- 
stitutes the first task, on which the sub- 
ject receives a moderate amount of 
training. He then proceeds to learn a 
second task in which the stimuli are 
altered in the ways to be described. 
Figure 1 is intended to illustrate the 
stimulus generalization gradients sur- 
rounding each of these stimuli at the 
beginning of learning the second task. 
For our purposes stimuli are considered 
to vary along the color dimension.* 

1. Increasing the similarity of the 
stimuli in the second task. This sort 
of variation may be illustrated by the 
following example: First task: red light 
—right key; green light—left key. 
Second task: orange light—right key; 
yellow-green light—left key. The re- 
sponse tendencies established by train- 


3 The units of color differences shown in the 
figure have been assumed to be equal for pur- 
poses of illustration only. Within the range 
shown it is known that judgments of simi- 
larity correspond to the spectral order, but 
the units of judged similarity have not been 
determined. 


ing on the initial task, and consequently 
assumed to be present at the beginning 
of learning the second task, are pic- 
tured in Fig. 1. The strengths of the 
measured responses to orange and yel- 
low-green stimuli are determined by 
the differences between “correct” and 
“incorrect” generalized response tend- 
encies in each case. There exist strong 
tendencies to make “correct” responses 
to the stimuli of the second task which 
are counteracted to a slight extent by 
tendencies to make “incorrect” re- 
sponses to these same stimuli. The 
result ot training, as indicated by the 
effective strengths of the responses, is 
that the learning of the second task 
begins with the desired responses al- 
ready partially established. The gen- 
eral effect of this type of change may 
be summarized as follows: When the 
stimuli of the second learning task are 
made more similar to each other than 
those of the first learning task, the re- 
sult is positive transfer decreasing in 
amount with increased similarity among 
the stimuli within the second task. 

As the stimuli of the second task are 
made increasingly similar with respect 
to those present in the initial task, it 
is evident from Fig. 1A that there will 
be not only decreasing “correct” re- 
sponse tendencies, but at the same time 
increasing strengths of “incorrect” re- 
sponse tendencies. A smaller difference 
between these conflicting tendencies 
would consequently result in smaller 
effective response strengths and a 
smaller amount of positive transfer. It 
is evident that this process of making 
the stimuli more alike in the second 
task may theoretically be carried to 
the point at which the stimuli are no 
longer discriminable, and the resulting 
transfer approaches zero. 

2. Decreasing the similarity of the 
stimuli in the second task. ‘This sort 
of variation may be illustrated by the 
following example: First task: red light 





‘asuodsay 3yoJ—azy ‘fasuodsay 3q3nNy—NaqIyH 
218 posmnboe 9q 0} suOoTEOSse (pasidA2I) 94} YIYA Ul YSeZ PUOdIs ew JO BuruseI] BY} JO y1e}S BY} Ze SaPUspUa}) payedxy :q ‘esuodsay iye]T— 
Naqiy)-anTg {asuodsay yZNY—aONVAGO Ie PasiNboe aq 03 SUOTZEDOSSe 9Y} ITY Ul Yse} PUOdaS B JO Bulusval ay} JO yAL}S 2Y} Je SaTQUapUa} pojI0d 
“XY :Q ‘asuodsoy 3J9]—Naauy-an1g ‘ossuodsay 3ysry—aay daaq sie pormnboe aq 0} sUOTJEPOSSE 3Y4} YDTYM Ul Yse} PuodIs B JO Futusvay ay} JO 
JAVYS 94} 3e Satuepus} poyedxy :q ‘asuodsay 3JoJ—NaaUy-MOTTAA Sasuddsay WYSrIY—AONVAO se pasINboe aq 0} SUOI}EIDOSSe By} GOIYM UT Yysez 
puoseas B JO Burusea] dy} JO 31e}S 9y} Je So1UIpUa, papadxy :y ‘suOdsey WJe]T—NaGUyH ‘esuodsay Wsry—aTy yse} 24} uO Zurures jo JuNOUIe 
ayesspoul & pey sey zefqns ay}, 3eY} PouINsse SI }I ase YOVI UT “pasaz[e JIB INUIT}S 94} YOIYM Ul Yse} PUuodIS B OF YSe} WONeUTUTTIISIP [eIjTUI 
ue WlOIy BIYS B Ut BSutINdI0 sasuodsa yII1I09UT PUL 4391109 JO SYy}ZuII3S yULR[NSe’ pue ‘satUapud} asuodsa1 JO syyZueI}s payedxq “| “O1g 


ALISV TINS SMINWILS ALEVINIS SMONILS 








423uu02 


493uu0> }aenos yo 
WSNOdS Bu 
AD3uu0oM 423u¥OoNL : Abed 


3A1193443 $2, MAPMABAS 1984s 








Bewoes3u 1430 mm Bswoesau 1437 
BSwoes3u indi BSnoesau sndiN 


-—<——<—=— 





AON3ON3L 3SNOGS3¥ 4O HLONZULS 





JONVEO 








fo 
& 
e 
n 
° 
7) 
i 
: 
ff 
< 
oa 
is) 
cs) 
4 
“a 
Z 
o 
< 
i) 
= 
o 


3SNOdS3¥ 
A93uwO2 40 
wioMawis 
3A:493443 3AU9Esa3 





BSeOesBu 1437 
BSnOeSBu indy 








AONZON3L 3SNOdS3Y 40 HLONZUIS 





RELATION BETWEEN SIMILARITY AND TRANSFER OF TRAINING 71 


—right key; green light—left key. 
Second task: deep red light—right key; 
blue-green light—left key. In terms of 
Fig. 1B, this kind of change means 
that the subject begins to learn the 
second task with strong response tend- 
encies to the “correct” keys, which are 
counteracted by relatively weak “in- 
correct” response tendencies. The effec- 
tive strengths of the right-key response 
to deep red and the left-key response 
to blue-green are again obtained by sub- 
tracting the conflicting tendencies in 
each case. This situation in which the 
stimuli are made less similar in the 
second task in contrast to the previ- 
ously described situation in which they 
are made more similar, results in con- 
siderably weaker “incorrect” response 
tendencies. Making the stimuli less 
alike in the second task may therefore 
be expected to produce even greater 
positive transfer than making the 
stimuli more alike, assuming an equiva- 
lent degree of change in stimulus 
similarity in each instance. Figure 1B 
indicates that the transfer decreases 
more gradually than is the case with 
transfer resulting from increasing stimu- 
lus similarity (as shown in Fig. 1A). 
Consequently, with decreasing similar- 
ity of the stimuli in the second task, 
the transfer comes to depend almost 
entirely upon the strengths of the gen- 
eralized “correct” response tendencies. 
The following statement summarizes 
this situation: When the stimuli of the 
second learning task are made less simi- 
lar to each other than those of the first 
learning task, the result is positive 
transfer decreasing in amount as the 
degree of similarity among the stimuli 
of the second task is decreased. 

3. Altering each of the stimuli in the 
same direction along a scale of similar- 
ity. This kind of variation is illustrated 
by the following example: First task: 
red light—right key; green light—left 
key. Second task: orange light—right 


key; blue-green light—left key. As 
illustrated in Fig. 1C, this situation has 
some interesting characteristics which 
distinguish it from those previously de- 
scribed. When the color of one light 
is changed from red to orange, this 
orange stimulus not only differs from 
red, but is at the same time in some 
degree more similar to green, the stimu- 
lus originally associated with a different 
response. In contrast, when the color 
of the green light is changed to blue- 
green, the blue-green color is similar to 
green but does not approach red in 
similarity. Thus the learning of the 
response to the blue-green light in the 
second task would be facilitated to a 
greater extent than the learning of the 
response to the orange light, because of 
the existence in the former case of a 
relatively weaker “incorrect” response 
tendency. Presumably this difference 
would be revealed in the learning of 
the second task by the occurrence of a 
greater number of errors to the orange 
light than to the blue-green light.‘ 
This situation may be summarized in 
the following statement: Jn proceeding 
to a second task from the learning of a 
first task, transfer decreases with in- 
creasing degrees of displacement of the 
stimuli in the same direction along a 
scale of similarity. 

4. Stimulus alteration involving medi- 
ated similarity. The stimuli employed 

4For the sake of brevity we limit our con- 
sideration of displacement to cases which in- 
volve relatively small changes in the stimulus. 
As the amount of displacerment increases, the 
stimulus to be associated with the right key 
response in the second task becomes increas- 
ingly similar to the stimulus previously asso- 
ciated with the left key, a situation in which 
the “incorrect” response tendency to this 
stimulus outweighs the “correct” response 
tendency. Transfer predictions for this situa- 
tion depend upon considerations to be de- 
scribed later in connection with reversal. Dis- 
placement includes the type of alteration 
which occurs in experiments on transposition 
behavior, and our analysis of it bears some 
resemblance to Spence’s (15). 





72 R. M. Gacné, K. E. Baker, AND H. Foster 


in a motor task may be replaced by 
other stimuli which are related, not be- 
cause they fall along some physical 
scale, but because previous training has 
associated them with identical re- 
sponses. The outstanding example is 
the case of verbal stimuli consisting of 
the appropriate “names” for the actual 
stimuli which make up the motor learn- 
ing situation. The importance of this 
kind of stimulation for human subjects 
is shown by their frequent verbaliza- 
tion in learning motor tasks. 

The extent to which transfer will 
occur between a motor task with stimuli 
such as lights and a task in which only 
the “names” for these stimuli are pre- 
sented would seem to depend upon the 
extent to which previous training has 
established habitual connections be- 
tween a light stimulus and the verbal 
response of its “name.” Presumably 
the individual will tend to react in the 
same way to the verbal stimulus “red” 
as to the stimulus provided by a red 
light, because in the past both of these 
have been habitually associated with a 
verbal response which in turn provides 
a conditioned stimulus. It seems evi- 
dent that stronger associations are 
established between some kinds of 
physical stimuli and their naming re- 
sponses than others, thus producing 
differences in transfer effects. 

The existence of these mediating 
stimuli also has a significant effect on 
transfer between two motor tasks. 
When two stimuli are equivalent be- 
cause of their association with a com- 
mon verbal response, transfer is ex- 
pected to occur; but its extent cannot 
be accurately predicted without a previ- 
ous measurement of the degree to which 
such mediating effects are present in 
any given situation. We have proposed 
that this measurement may be made 
by determining the similarity between 
stimuli by means of subjects’ judgments 
which are assumed to reflect the degree 


to which the stimuli have been prac- 
ticed with a common verbal response. 
This proposal implies that the shape of 
the gradient of generalization must be 
measured in terms of similarity judg- 
ments before predictions of transfer can 
be made with any degree of accuracy. 

5. Varying the number of discrimi- 
nable aspects of stimulation. A motor 
task may be altered by reducing or in- 
creasing the number of discriminable 
aspects of the stimuli which are to be 
associated with different responses. If, 
in our now familiar example, the red 
light were on the right side of the stimu- 
lus panel and the green on the left, a 
reduction in the number of discrimi- 
nable aspects of the stimuli would be 
effected if the second task involved 
simply white lights in right and left 
positions. On the other hand, increas- 
ing the number of discriminable aspects 
of stimulation may be illustrated by 
means of the following second task: 
Square red light on right—press key on 
right; ‘circular green light on left— 
press key on left. 

It is supposed that each individual 
aspect of stimulation which has en- 
tered into training has generated its 
own gradient of generalization similar 
to those which have been described in 
the previous section. It seems reason- 
able to assume that the degree of simi- 
larity between stimuli decreases with 
the number of stimulus aspects which 
form the bases for the discrimination. 
Thus the task which requires different 
responses to stimuli differing only in 
color is assumed to be a more difficult 
task than one which requires the same 
responses to stimuli which differ from 
each other in shape and size as well. 
In such a case a smaller degree of over- 
lapping between the _ generalization 
gradients would be expected, and 
weaker “incorrect” response tendencies 
are assumed to exist for each of the 
stimuli concerned. 





RELATION BETWEEN SIMILARITY AND TRANSFER OF TRAINING 73 


With this arrangement we are able 
to speak of making the stimuli of a 
task more similar or less similar to 
each other by reducing and _ increas- 
ing, respectively, the number of dis- 
criminative aspects of stimulation pro- 
vided by each. The implications to be 
drawn from this sort of analysis cor- 
respond with those outlined in a previ- 
ous section. We should again expect 
the increase in similarity, in the sense 
of reducing the number of discrimi- 
native aspects of stimulation, to result 
in positive transfer which decreases as 
the stimuli are made more alike. We 
should also expect a high degree of 
positive transfer to result from making 
the stimuli of the second task more 
different by increasing the number of 
differential cues provided by each, be- 
cause of the relatively weak interfer- 
ing response tendencies established by 
training on the initial task. 


The Effects of Response Variation on 
Transfer 


The assumption of gradients of re- 
sponse generalization leads to conclu- 
sions concerning the relation between 
response similarity and transfer which 
are in many respects the same as those 
which follow from our analysis of stimu- 
lus similarity. It is supposed, for ex- 
ample, that when a subject acquires 
differential responses of 4 ounces and 
12 ounces to two different stimuli, he 
also acquires tendencies to react with 
similar but unidentical degrees of force 
to the same stimuli. The effect on 
transfer of response variation from a 
first task to a second is also conceived 
to be analogous to the effect of stimu- 
lus variation. (a) A second task may 
be altered from a first task by increas- 
ing the similarity between the differ- 
ential responses to be acquired. The 
resulting transfer would be expected to 
be positive and to decrease in amount 
as the responses were made more alike. 


(b) The responses of the second task 
may be made less similar to each other 
than those of the first. Again positive 
transfer, decreasing in amount as the 
responses are made less alike, is the 
result to be expected. (c) The re- 
sponses of the second task may be dis- 
placed in the same direction on a scale 
of similarity. Such a change would also 
be expected to yield transfer in accord- 
ance with the propositions of the previ- 
ous section, i.e., transfer would decrease 
in amount with increasing degrees of 
displacement. 

The responses within the second task 
may be made more alike or less alike 
than those within the first by changing 
the number of differential aspects which 
they possess. For example, if the first 
task required responses differing from 
each other in both direction and in 
force, the second task might require 
responses which differ in force only, 
thus making the responses more similar 
to each other. In accordance with our 
previous analysis we should expect posi- 
tive transfer to decrease to the extent 
that these responses have been made 
more similar to each other in the second 
task. It will be recognized that an 
example could readily be given to illus- 
trate the case in which responses are 
made less similar to each other in the 
second task by adding some character- 
istic which was not present in the re- 
sponses of the first task. Again trans- 
fer would be predicted in accordance 
with the analogous instance of stimulus 
variation. 

Perhaps the most important thing to 
point out regarding the. transfer effects 
of variation in responses from one task 
to another is that we are not ‘led to 
conclude that different results are ob- 
tained by varying stimuli alone and 
by varying responses alone. The Wylie 
hypothesis (16) predicts positive trans- 
fer when responses remain constant and 
stimuli are varied and negative transfer 





74 R. M. Gacné, K. E. Baker, AND H. Foster 


when stimuli remain constant and re- 
sponses are varied. In contrast to 
this, the present hypothesis states that 
when stimuli are maintained constant, 
variations in the similarity of the re- 
sponses within a task produce positive 
transfer just as do variations in simi- 
larity of stimuli when responses are 
held constant. The conditions which 
lead to negative transfer, on the other 
hand, are conceived to involve a type 
of variation called reversal which can- 
not be described fully in terms of simi- 
larity between responses. The condi- 
tions of reversal are described in a 
following section. 

One type of response variation which 
is not extensively considered here in- 
volves requiring the subject to make 
responses in the second task with an 
entirely different body member. In 
such instances, provided the stimuli of 
the two tasks remain the same, positive 


transfer may be expected. Studies have 
shown, for example, considerable de- 
grees of positive transfer of motor tasks 
between such members as the right and 
left hands (3), or the hand and foot 


(1, 3). Presumably such transfer 
occurs because these responses are 
highly similar even though they do in- 
volve different members. Whether this 
similarity exists because of previous 
learning or because of some kind of 
innate neural organization, we make no 
attempt to answer here. Another rea- 
son why we are led to expect positive 
transfer between body members is 
simply because of the fact that the 
stimuli of the two tasks remain iden- 
tical. The effect of practice on the 
first task is consequently to make the 
stimuli become more and more distinc- 
tive, and this effect carries over to the 
second task even when the responses 
required in that task make use of en- 
tirely different parts of the body. Our 
reasoning in this instance is essentially 
the same as that of Gibson (4), who 


discusses this phenomenon as stimulus 
pre-differentiation. 


Transfer Effects of Reversing Stimulus- 
Response Relationships 


It is now necessary to introduce a 
type of task variation which is different 
in a fundamental sense from those 
previously considered. Up to the pres- 
ent we have dealt with situations in 
which certain kinds of changes are 
brought about in stimuli associated 
with given responses, or with changes 
in responses associated with given 
stimuli. The type of variation which 
we wish to describe here, however, arises 
when we ask the subject in the second 
task to learn to associate a given stimu- 
lus with a response which has previously 
been associated with another stimulus 
of the same task. In terms of our 
standard example the situation would 
be represented as follows: First task: 
red light—right key; green light—left 
key. Second task: red light—left key; 
green light—right key. This transfer 
situation involves complete reversal of 
the stimulus-response relationships in 
proceeding from the first task to the 
second. Figure 1D represents the ex- 
pected results at the beginning of the 
learning of the second task. It will be 
seen that the learner begins this task 
with a tendency (“correct” for this 
task) to make a left-key response of 
nearly zero strength to the red light, 
and with the tendency to make the 
right-key response of considerable 
strength (now “incorrect”) to the red 
light. Likewise the tendency to make 
the right key response (“correct” for 
this task) to the green stimulus is of 
nearly zero strength, whereas the tend- 
ency to make a left-key response (now 
“incorrect”) is great. This means that 
the subject begins to learn the second 
task with the considerable handicap of 
strong interfering (“incorrect”)  re- 
sponse tendencies. It is easy to under- 





RELATION BETWEEN SIMILARITY AND TRANSFER OF TRAINING 75 


stand that under such conditions he 
will not learn the task as rapidly as he 
would if he had not had any previous 
training on the first task, .e., the result- 
ing transfer is negative. Although situ- 
ations of this sort have not been sys- 
tematically investigated in studies of 
motor learning, somewhat comparable 
conditions occur in the studies of Hun- 
ter (11) and Bunch (2), in which ani- 
mals first learned one maze pattern and 
were subsequently required to learn a 
pattern whose correct turns were ex- 
actly reversed. In both these studies 
large amounts of negative transfer were 
found. Our conclusion is that reversal 
of the stimulus-responses relationships 
within a discrimination learning task 
is the typical condition for negative 
transfer. The results of complete re- 
versal may be summarized as follows: 
When the stimuli and responses of the 
second learning task are identical with 
those of the first learning task, but the 
relationships between these stimuli and 
responses are reversed, the result is a 
maximum amount of negative transfer. 

One important characteristic of the 
situation which involves complete re- 
versal is that it makes little difference 
whether the responses have been re- 
versed or the stimuli reversed. One can 
say either that the subject is required, 
in the second task, to learn to make the 
same responses to reversed stimuli, or 
that the subject is required to learn to 
make reversed responses to the same 
stimuli. 

Because of the general acceptance of 
equivalence between the terms, it is im- 
portant to emphasize the distinction 
between interference and negative trans- 
fer which this discussion implies. We 
conceive of interference as the likely 
result whenever incorrect response 
tendencies are present. Thus in the 
previously described situation of mak- 
ing the stimuli more similar to each 
other in a second task, the degree of 


interference which occurs was seen to 
depend upon the degree to which the 
stimuli were made more alike, and 
would be measured by an increase in 
errors or by a decrease in response 
time. In that situation, however, the 
over-all effect, despite the interference, 
was seen to be positive transfer. It is 
only when the incorrect response tend- 
encies become greater in amount than 
the correct response tendencies that we 
expect to get negative transfer. In 
such a case the learning of a second 
task is interfered with to so great an 
extent that the rate of learning is 
slower than the learning of the same 
task would be without previous train- 
ing on an initial task. Thus, the ex- 
perimental meaning of negative transfer 
is a reduction in rate of learning, fol- 
lowing practice on a previous task, be- 
low the rate which would have been 
achieved without any previous practice. 

The transfer situation may be set up 
in such a way that the two tasks vary 
in the degree to which they approach 
this condition of complete reversal. 
Such situations are referred to as partial 
reversal. Let us suppose again that the 
original task is one which presents the 
following discrimination: red light— 
right key; green light—left key. Sup- 
pose, then, that we altered this to a 
final task like this: orange light—left 
key; yellow-green light—right key. It 
is evident that in such a situation the 
subject is required to learn to make 
reversed responses to stimuli which are 
highly similar to the original ones, 
thought not identical with them. It 
will be realized from an examination of 
Fig. 1D that at the beginning of the 
learning of the second task the “incor- 
rect” response tendencies to the orange 
light and the yellow-green light would 
in this case far outweigh the “correct” 
response tendencies to the same stimuli. 
It is expected, therefore, that negative 
transfer would be the result, i.e., that 





76 


R. M. Gaoné, K. E. BAKER, AND H. Foster 


TABLE I 


SHOWING THE TYPES OF VARIATION WHICH MAY BE MADE IN A SECOND LEARNING TASK WITH 





RESPECT TO A First LEARNING TASK, AND THE TRANSFER EXPECTED TO RESULT FROM EACH 





Type of variation 
in second task 


Stimuli of 
second task 


Responses of 
second task 


S-R relationships of 
second task 


Transfer 





Stimulus 
alteration 


Response 
alteration 


Complete 


reversal 


Partial 
reversal 


the training on the first task would 


increasingly 
similar 


increasingly 
dissimilar 


same degree 

of similarity, 
increasingly 
_displaced® 


[identical 


identical 





identical 





identical 


rincreasingly 
| similar, 
increasingly 
Pottery 

or increasingly 
displaced 


identical 





| 
|“ 





identical 


identical 


identical 


increasingly 
similar 


increasingly 
dissimilar 


same degree 
of similarity, 
increasingly 

displaced® 


identical 


identical 


increasingly 
similar, 
increasingly 
dissimilar, 

or increasingly 
displaced® 





identical 


identical 


identical 


identical 


identical 


identical 


reversed 


reversed 


reversed 


the amount of negative transfer. 





positive, decreas- 
ing in amount 


positive, decreas- 
ing in amount 


positive, decreas- 
ing in amount 


positive, decreas- 
ing in amount 


positive, decreas- 
ing in amount 


positive, decreas- 
ing in amount 


maximum 
negative 


negative, 
decreasing in 
amount 


negative, 
decreasing in 
amount 


Zero 


hinder rather than aid the learning of 
the second. It will also be evident that 
as the stimuli to be associated with 
reversed responses become more alike, 
the smaller will be the “incorrect” tend- 
encies, and consequently, the smaller 

5 Transfer predicted from this type of vari- 
ation is conceived to hold only for relatively 
small amounts of stimulus change. See foot- 
note 4. 


transfer is approached as the response 
tendencies to correct and incorrect keys 
become more equal and tend to cancel 
each other out. This conclusion may 
be summarized as follows: When the 
stimuli of the second learning task are 
made more similar to each other than 
those of the first learning task, and the 
responses are reversed, the result is 





RELATION BETWEEN SIMILARITY AND TRANSFER OF TRAINING 77 


negative transfer decreasing in amount 
as the similarity between stimuli is 
increased. 

It is hoped that our discussion up to 
this point has made clear the prediction 
that the reversal of stimulus-response 
relationships leads to negative transfer. 
Moreover such reversal is to be con- 
trasted in its effect upon transfer with 
the kind of change in the internal simi- 
larity of a task which maintains the 
same relationships between the stimuli 
and responses. Table I provides a sum- 
mary of the conclusions which have 
been arrived at in accordance with this 
hypothesis, and serves at the same time 
to emphasize the differences between 
the conditions of positive transfer and 
negative transfer. The table contains 
descriptions of the different ways in 
which the elements of a second task 
may be altered with respect to those 
of the first. The transfer which is 
predicted to follow each specific kind 
of change, in accordance with our hy- 
pothesis, is shown in the final column. 
As a general summary it may be said 
that when S-R relationships remain 
identical, the greater the amount of al- 
teration of the elements of the task 
the smaller will be the positive trans- 
fer; when S-R relationships are re- 
versed, the greater the amount of al- 
teration of the elements of a task, the 
smaller will be the amount of negative 
transfer. 


DISCUSSION 


Although our analysis has for rea- 
sons of clarity been confined to dis- 
criminative learning situations in which 
the responses are accomplished by 
movements of the skeletal muscles, its 
reasoning may be applied with little 
change to the learning of verbal re- 
sponses by the paired-associate method. 
It is not our intention to emphasize a 
distinction between motor learning of 
the type we have been describing and 


paired-associate verbal learning. On 
the contrary, we consider them to be 
quite identical in their essential aspects. 
Not only do we feel that the same prin- 
ciples of learning apply to both, 
but the importance of self-instructional 
stimuli has been sufficiently emphasized 
to indicate that the learning of dis- 
criminative motor responses cannot be 
completely understood without a con- 
sideration of the kind of verbal reaction 
which the human subject displays when 
he learns a motor task. Space prevents 
the presentation of specific implications 
of this hypothesis to the problem of 
transfer in the learning of paired asso- 
ciates. It may be suggested, however, 
that in many studies of this problem, 
inter-task variation has brought about 
uncontrolled variations among the ele- 
ments within each task. 

Our discussion has confined itself to 
motor tasks of a relatively simple sort: 
those in which the subject is required 
to make one motor response to one 
stimulus, and a different response, using 
the same body member, to a second 
stimulus. It is recognized that a rela- 
tively small proportion of actual motor 
skills are as simple as this. In fact it is 
usually with regard to more complex 
skills that the question of transfer 
arises. Nevertheless we believe that 
our analysis forms the basis on which 
investigations of transfer in these more 
complex skills must rest. It is evident 
that the usual source of complexity in 
motor skills of this type lies in the 
increase in mumber of discriminative 
habits which the individual is required 
to learn. Thus it is common to find 
that an operator must acquire skill not 
simply in pressing two different keys 
in response to two different lights, but 
rather in making 15 er 20 different 
responses to as many different stimuli. 

The way in which learning and trans- 
fer vary with the number of discrimina- 
tive responses of the task is an impor- 





78 R. M. Gacné, K. E. Baker, AND H. Foster 


tant problem in the field of motor learn- 
ing, comparable to that which concerns 
the factor of amount of material (or 
length of list) in studies of verbal learn- 
ing. There is obviously a need for 
investigation of the relationships be- 
tween variation in number of dis- 
criminative responses and difficulty of 
learning, as well as the relation between 
number of responses and_ transfer. 
However, it is our belief that experi- 
ments which are designed to attack 
these problems will need to take into 
account the intra-task similarities of 
both stimuli and responses, as they have 
been discussed here. In a task in which 
10 different responses must be acquired 
to 10 different stimuli, any given stimu- 
lus of the total set is more or less simi- 
lar to any of the other 9 stimuli, and 
any given response is more or less 
similar to the other 9 responses. If 
one were to study transfer with such a 
task in a situation in which all 10 
stimuli were to be altered to some de- 
gree, it is our contention that the altera- 
tion must be defined in terms of changes 
in the similarity among the 10 stimuli 
of the task itself in order for mean- 
ingful relationships between alterations 
and transfer to be found. An initial hy- 
pothesis to be applied to experimental 
studies of such complex skills might 
well be that the generalization tend- 
encies of the responses to each stimulus 
summate in a simple algebraic manner 
with the gradients associated with the 
other stimuli of the total set. Also, 
conceiving of transfer as falling on a 
continuum, the positive and negative 
transfer effects predicted for the sepa- 
rate discriminative habits may be ex- 
pected to summate algebraically, allow- 
ing for prediction of the net transfer 
result of stimulus and response varia- 
tions. 

The conflict between the conclusions 
of the present analysis and the Wylie 
hypothesis (16) has already been 


pointed out. According to the present 
set of principles, positive transfer is to 
be expected from changes in either the 
responses or the stimuli, or both, pro- 
vided that these changes do not involve 
reversal of the relationships of stimuli 
to responses within the task. A con- 
trast may also be noted between these 
predictions and the Skaggs-Robinson 
hypothesis (14). Whereas the latter 
predicts a low point in transfer when 
the first and second tasks are dissimilar 
to some unspecified degree, and increas- 
ing transfer beyond this point of dis- 
similarity, our hypothesis predicts that 
the amount of positive transfer will 
progressively decrease with greater de- 
grees of alteration in the similarity of 
elements within a task. In place of the 
idea that a minimum point in trans- 
fer is reached and passed as the simi- 
larity of the first and second tasks de- 
creases, the hypothesis of the present 
paper conceives the low point in trans- 
fer (maximum negative transfer) to be 
attained by the manipulation, not of 
similarity, but of a different variable— 
stimulus-response relationships. 

It is not at all evident that our hy- 
pothesis makes predictions which are 
new in a practical sense. Many indi- 
viduals are able to make perfectly good 
rough predictions of the kind of trans- 
fer to be expected from a given type of 
change in a learning task. Thus it does 
not come as a surprise to most people 
to learn that the typical condition of 
negative transfer is one of reversal, or 
that positive transfer is the usual result 
when the stimulus or response aspects 
of one task are similar to those of 
another. At the present time, though, 
few results of contrelled experiments 
can be adduced in support of the more 
specific implications of these hypothe- 
ses. The reason for this lack of evi- 
dence lies in the fact that experiments 
on transfer have not been carried out 
under conditions which have permitted 





RELATION BETWEEN SIMILARITY AND TRANSFER OF TRAINING 79 


the control and systematic variation of 
the variables which are conceived in 
our analysis to be operating in the 
transfer situation. It is our hope that 
future experiments will be designed 
specifically to test the predicted effects 
of these variables on transfer. 


SUMMARY 


An analysis is made of the implica- 
tions for transfer of alterations in the 
stimuli and in the responses of discrimi- 
native motor learning tasks. It is con- 
tended that these alterations are most 
clearly stated in terms of the similari- 
ties existing among the stimuli and 
among the responses of each task, 
rather than in terms of similarities be- 
tween tasks. Four main types of altera- 
tion are described: (a) making the 
stimuli (or responses) of the second 
task more alike than those of the first; 
(b) making the stimuli (or responses) 
of the second task less alike than those 


of the first; (c) displacing the stimuli 
(or responses) of the second task, rela- 
tive to those of the first task, in the 
same direction along a scale of simi- 
larity; and (d) reversing the stimulus- 
response relationships of the second task 
in relation to what they are in the first 


task. With the use of the concept 
of the gradient of generalization, con- 
ceived as applicable to both stimulus 
and response, the results of each of 
these types of alteration are related by 
hypothesis to positive and negative 
transfer effects. Certain further impli- 
cations of the hypothesis for the design 
of experimental studies of transfer in 
paired-associate verbal learning, and in 
complex motor skills of the discrimi- 
native type, are also discussed. It is 
believed that the principles arising from 
this analysis provide a useful framework 
for the statement of the relationships 
of “similarity” to transfer. 


REFERENCES 


. Bray, C. W. Transfer of learning. J. 
exp. Psychol., 1928, 11, 443-467. 

. Buncu, M. E. Transfer of training in the 
mastery of an antagonistic habit after 
varying intervals of time. J. comp. 
Psychol., 1939, 28, 189-200. 

. Coox, T. W. Studies in cross-education. 
III. Kinesthetic learning of an irregular 
pattern. J. exp. Psychol., 1934, 17, 
749-762. 

. Gisson, E. J. A systematic application of 
the concepts of generalization and dif- 
ferentiation to verbal learning. Psy- 
CHOL. Rev., 1940, 47, 196-229. 

Intra-list generalization as a fac- 
tor in verbal learning. J. exp. Psychol., 
1942, 30, 185-200. 

. Gutiixsen, H., & Wotrre, D. L. A 
theory of learning and transfer: I. 
Psychometrika, 1938, 3, 127-149. 

. Hovtanp, C. I. The generalization of 
conditioned responses: I. The sensory 
generalization of conditioned responses 
with varying frequencies of tone. J. 
gen. Psychol., 1937, 17, 125-148. 

The generalization of conditioned 
responses: IV. The effects of varying 
amounts of reinforcement upon the de- 
gree of generalization of conditioned 
responses. J. exp. Psychol., 1937, 21, 
271-276. 

. Huir, C. L. Principles of behavior. New 
York: Appleton-Century, 1943. 

The problem of primary stimulus 
generalization. Psycuot. Rev., 1947, 
54, 120-134. 

. Hunter, W. S. Habit interference in the 
white rat and in human subjects. J. 
comp. Psychol., 1922, 2, 29-59. 

. Oscoop, C. E. The similarity paradox in 
human learning: A resolution. Psy- 
CHOL. ReEv., 1949, 56, 132-143. 

. von RestorrF, H. Analyse von Vor- 
gingen im Spurenfeld. I. Uber die 
Wirkung von Bereichsbildungen im 
Spurenfeld. Psychol. Forsch., 1933, 18, 
299-342. 

. Rosrnson, E. S. The ‘similarity’ factor 
in retroaction. Amer. J. Psychol. 
1927, 39, 297-312. 

. Spence, K. W. The differential response 
in animals to stimuli varying within a 
single dimension. Psycnov. Rev., 1937, 
44, 430-444. 

. Wyte, H. H. An experimental study of 
transfer of response in the white rat. 
Behav. Monogr., 1919, 3, No. 16. 


[MS. received July 6, 1949] 





A STIMULUS-RESPONSE ANALYSIS OF THE HOARDING 
HABIT IN THE RAT 


BY MELVIN H. MARX 


University of Missouri 


INTRODUCTION 


In spite of the early theoretical em- 
phasis on habits as “chained reflexes” 
psychologists have spent relatively little 
_time in actual experimental analysis of 
complex and heterogeneous learned ac- 
tivities. With scattered exceptions the 
more analytic experimental work has 
centered on fairly simple and homoge- 
neous types of responses like bar-press- 
ing and maze-running. The choice of 
such responses is of course to be largely 
explained by the greater control that 
is possible with them and the expecta- 
tion that the relationships between the 
basic variables may thus be more effec- 


tively treated. As a consequence, how- 


ever, general synthesizing principles 
have been more or less neglected. 

The hoarding habit of the laboratory 
rat seems to afford a particularly good 
opportunity for investigation of be- 
havior synthesis. It is the primary 
purpose of the present study to utilize 
the common learning principles derived 
from basic animal experimentation in 
the interpretation of this more complex 
and heterogeneous habit. The hypothe- 
sis will be advanced that the hoarding 
behavior pattern can be most ade- 
quately explained as an integrated series 
of heterogeneous learned responses. 

In the previous paper in this series 
(19) an attempt was made to place the 
hoarding behavior of the laboratory rat 
in the usual framework of the psy- 
chology of learning, and to present pre- 
liminary observational evidence in sup- 
port of the view that this behavior is 
a function of learning, rather than of 
innate factors as has been generally 
held. Before summarizing certain of 


the more important aspects of that evi- 
dence a brief orienting review will be 
presented for those readers not already 
familiar with the special literature of the 
hoarding studies. 


Hoarding Studies and Previous 
Interpretations 


The strong propensity of the laboratory 
rat to carry food pellets about in its cage 
is readily observed by anyone familiar 
with the typical rat colony. Experimental 
study of this behavior tendency, however, 
has been restricted to the last ten years 
or so during which a number of studies 
of “hoarding” or “food-storing” have been 
reported. In the great majority of these 
the animal has been confined for varying 
periods of preliminary accommodation in 
a small individual living cage, usually with 
some degree of food deprivation intro- 
duced as a factor. In the hoarding test 
proper the rat is given access to an alley 
leading away from the cage and to a bin 
full of food pellets (or similar transferable 
materials). The number of pellets re- 
turned to the cage within an arbitrarily 
fixed testing period (usually thirty minutes 
daily) is recorded as a measure of the 
hoarding activity of the animal. Under 
the conditions of the usual experiment this 
behavior begins more or less slowly but 
is gradually accelerated. In extreme cases 
an animal may spend most of the thirty 
minute period scurrying back and forth 
with pellets until as many as fifty or sixty 
are accumulated in the cage. Not only 
has there been a wide variety of investi- 
gations into the more relevant variables 
presumably influencing such food-carrying 
behavior (3, 4, 5, 24, 25, 26, 41, 44, 46, 
47, 51) but the hoarding activity itself has 
been used as a dependent variable in 
studies of “frustration” (17, 22) and the 
adult effects of infantile deprivation (15, 
16, 23, 45). 





StrmuLus-RESPONSE ANALYSIS OF HOARDING HABIT IN THE RAT 81 


Little progress has been made towards 
an adequate theoretical interpretation of 
hoarding. The most recent attempt has 
been by Bindra (5) who postulates “shy- 
ness” and “security” as the relevant vari- 
ables. Although this appears to point in 
a promising direction it does not clearly 
state the particular factors that produce 
hoarding. By far the most popular type 
of explanation advanced has been some 
variation of the thesis that hoarding has 
an instinctive basis. This has resulted 
largely from the expressed inability of the 
investigators (e.g., 25, p. 667, and 46, p. 
53) to see how the usual rewards could 
operate to reinforce the hoarding behavior 
in well-fed rats which return pellets regu- 
larly but fail to feed on them. The most 
explicitly formulated version of this point 
of view may be found in Morgan’s paper, 
“The Hoarding Instinct” (30), but it is 
clearly evident in other reports and ap- 
pears to be the orthodox view of the na- 
turalistic type of observer. Although the 
manifest non-explanatory nature of such 
an instinct hypothesis need not be labored, 
Morgan’s own excellent comment, from 
another context, may be usefully cited: 
“The principle most often followed in prac- 
tice, it seems, is that of calling any be- 
havior instinctive which is somewhat com- 
plicated and whose mechanisms are not 
understood by the observer. Instinctive 
behavior, therefore, gets defined in terms 
of the ignorance of the observer” (29, 
p. 396). 


Observational Basis of Present 
Interpretation 


In the previous paper (19) data were 
presented from a number of simple, ortho- 
dox hoarding experiments utilizing food 
pellets, wooden blocks, and milk pellets 
(cotton dental plugs saturated with fresh 
milk). The data, appropriately analyzed, 
gave evidence not only of learning but also 
of generalization, discrimination, extinc- 
tion, resistance to extinction, and spon- 
taneous recovery. Negatively accelerated 
learning and extinction curves were found, 
despite the complications introduced by 
the arbitrary design of the typical hoard- 
ing experiment (such as the fixed thirty 


minute test period ‘with no control of 
important variables like amount of feeding 
in the cage following the return of each 
pellet, the number of pellets returned per 
daily period, etc.). Moreover, careful ob- 
servations and objective recordings of the 
time and extent of the first movements of 
the naive rat into the alley gave striking 
evidence of a distinctly hierarchai develop- 
ment of the early exploratory—and pre- 
hoarding—behavior. The animals were 
observed to make a large number of pre- 
liminary, partial entrances into the alley 
before reaching the bin end, and typically 
to make one or more excursions to the 
bin before returning with the first pellet. 
A certain amount of direct feeding on the 
pellet-—and presumably corresponding re- 
inforcement—was also observed to occur 
following the first few pellet returns. The 
subjective impressions of the observers 
that the onset of hoarding was an active 
learning process were supported by the 
objective data. Failure of previous in- 
vestigators to find similar empirical evi- 
dence for the primary role of learning 
was suggested to be a function of their 
failure to make adequate efforts to find it, 
and particularly of their failure to make 
the relevant observations of the pre-hoard- 
ing and early hoarding phases of the 
behavior. 


THEORETICAL INTERPRETATION 


A major postulate of this paper is 
that the typical hoarding habit may be 
primarily explained as the result of the 
fusion of independently acquired habits. 
These will first be described separately, 
after which consideration will be given 
to the problem of their integration. 


1. Pre-experimental Responses to 
Food Pellets 


In all the hoarding experiments thus 
far reported, with the exception of a 
part of the original one by Wolfe (51), 
the rats had apparently been main- 
tained on food pellets at some time 
prior to their introduction to the hoard- 
ing situation. Although this condition 
has been recognized as a possible factor 





82 MeEtvIn H. Marx 


in the later development of hoarding 
behavior (51, 46), nc attempt has been 
made to treat it in a systematic or 
analytic manner. In the present analy- 
sis three salient aspects of the rat’s 
pre-hoarding responses to the food 
pellet as a stimulus have been differen- 
tiated and will be considered in turn. 

a. Pellet seizure. The pellet-seizure 
response is prominent in the behavior 
repertoire of the laboratory rat under 
normal living conditions, and is par- 
ticularly marked in rats living in groups. 
Typically, upon the introduction of 
new food pellets or the re-discovery of 
old ones, the rat may be observed to 
grasp the pellet in its mouth and carry 
it about rather than to start to feed 
upon it at once. As the first component 
in the usual cage eating sequence, 
therefore, the pellet-seizure response 
may be expected to be strongly rein- 
forced by the subsequent drive reduc- 
tion associated with actual feeding be- 
havior. Moreover, it may also be 
expected to be reinforced, more directly, 
by the response-produced stimulation 
that invariably follows jaw muscle con- 
tractions, including those which are 
actually involved in direct feeding. A 
similar kind of immediate secondary 
reinforcement has been suggested by 
Spence (43) and experimentally indi- 
cated by Grice (11) in their treatment 
of the problem of delayed reward in 
visual discrimination learning in the 
rat. 


There is also the possibility of some 
kind of direct sensory “reinforcement” 
from the (mainly) olfactory stimuli pro- 
duced by the food pellet, as suggested by 
Young (52) in his recent theoretical inter- 
pretation of the work on food preference 
and palatability. Bash’s (1, 2) work on 
responses to food in rats with neurophysi- 
ologically ‘isolated intestinal tracts offers 
further indirect support for this supposi- 
tion, although it is also susceptible of ex- 
planation in accordance with the principles 


of secondary reinforcement. It is this sort 
of “reinforcement” which may be respon- 
sible for the primary approach responses 
of animals to certain pre-potent types of 
stimulation. In so far as it is based on 
inherited sensory structures and is, presum- 
ably, independent of previous experience, 
such a response in its early appearance 
may be said to be innately determined. 
Subsequent activation of it, however, and 
especially subsequent modification and in- 
tegration with other responses, are clearly 
functions of learning. 


Additional sources of reinforcement 
for the pellet-seizure response will be 
suggested in the following discussion. 

b. Pellet carrying. After seizing a 
food pellet the rat typically carries it 
about the cage. Such running re- 
sponses, with food pellets grasped in the 
jaws, are particularly evident in rats 
caged in groups. But they are also 
found even in the individually caged 
rat, which tends to seize the pellet 
whether offered by hand or dropped into 
the cage, and carry it to a back part 
of the cage. So strong is this tendency, 
as a matter of fact, that it is regularly 
observed to occur, although with re- 
duced intensity, even in rats on unlim- 
ited feeding schedule and at times when 
a pellet is routinely selected from an 
ample supply of pellets which have been 
in the cage for some hours. 

The running response following pellet- 
seizure may be interpreted as having 
been originally made to the frightening 
stimuli associated with the providing of 
the pellets (noise, sight of the feeder’s 
hand and body, etc.) or, in the social 
situation, with the active competition 
for the pellets by other animals. A re- 
cent report by Calhoun (7) suggests the 
important role of competition in the 
securing of food by trapped wild rats 
living in a one hundred foot square pen 
with a central food source. Vigorous 
attempts by one rat to take a pellet 
out of another’s jaws are clearly demon- 





StrmuLus-RESPONSE ANALYSIS OF HOARDING. HABIT IN THE RAT 83 


strated in a film by Mowrer (33). In 
accordance with orthodox learning prin- 
ciples the stimulus situation associated 
with the arousal of such emotional re- 
actions may be expected to re-arouse 
them—in the milder form of “anxiety” 
—upon subsequent occasions (32, 34, 
35, 36, 20, 27, 9). Thus the running 
response would tend to be conditioned 
to the kinesthetic stimulation produced 
by the pellet seizure. 

Empirically the rat is observed to 
persist in its running activity with the 
pellet until a relatively “safe” place has 
been reached, that is, until the fear- 
arousing and action-inducing cues are 
sufficiently reduced. The consequent 
anxiety-reduction serves to reinforce 
the earlier pellet-seizure response as 
well as the running response, in the 
behavior sequence initiated by pellet 
recognition, and may continue to do so 
even after the originally effective threat- 
ening stimuli have become inoperative 
(20, 27, 9). 

c. Pellet release. In animals having 
previous experience with food pellets 
the pellet-release response, like the 
pellet-seizure and pellet-carrying re- 
sponses, is acquired prior to the hoard- 
ing test. It is obviously impossible for 
a rat to feed on a pellet as long as the 
pellet is firmly held by the jaws. Re- 
lease of the pellet is therefore a neces- 
sary component of the usual behavioral 
sequence in normal cage feeding. In 
a hungry animal this occurs quickly 
after a position of relative seclusion has 
been reached. Nibbling upon the pellet 
is usually either carried out directly 
while it rests on the cage floor or with 
the aid of the forepaws after it has been 
reduced to smaller pieces. The act of 
releasing the pellet may thus be consid- 
ered to be conditioned to the proprio- 
ceptive stimulation associated with the 
cessation of movement and to be re- 
inforced by the more or less immedi- 


ately occurring direct feeding with re- 
sultant drive reduction. 

In summary: Prior to the first hoard- 
ing test the experimentally naive rat has 
acquired a strongly reinforced sequence 
of responses initiated by pellet recogni- 
tion and involving the seizure, carrying 
and eventual release of the food pellet. 


2. Pre-hoarding “Homing” Habit 


The observations previously reported 
(19) gave the unmistakable impression 
that in the course of its preliminary 
explorations in the alley the naive rat 
was forming a strong tendency to leave 
and return into the cage. While the 
rat was in the alley, particularly in the 
early training stages, any unusual stimu- 
lus was likely to result in a quick return 
into the cage. The alley excursions 
were observed gradually to lengthen 
until the rat reached the bin area, soon 
after which it generally began to return 
with pellets. 

The rapid development of this habit 
of leaving and returning to the cage, 
clearly evident before the actual onset 
of hoarding, may be simply but ade- 
quately explained upon the basis of the 
secondary reinforcing properties of the 
various stimuli produced by the cage 
(mainly olfactory and visual) and pre- 
viously associated with such primary 
need reduction as has been effected by 
regular eating and drinking in the cage. 
In earlier hoarding studies (26, 47) it 
has been shown that “familiarity” with 
the cage, mainly dependent upon olfac- 
tory and visual factors, is a most impor- 
tant determinant of hoarding, which is 
not easily induced in rats newly placed 
in strange cages. 

With the assumption of such second- 
ary reinforcing properties it is not dif- 
ficult to understand why sudden exter- 
nal stimuli should send the freely ex- 
ploring rat quickly back into the cage, 
as was consistently observed in our ex- 
periments. In the absence of any obvi- 





84 Metvin H. Marx 


ous strong external stimulation the mild 
but mounting “tensions” accumulating 
from the influx of new stimuli in the 
alley would eventually also send the 
animal in the direction of familiar, 
anxiety-reducing cues—thus resulting 
in an apparently “spontaneous” return 
to the cage. 

Bindra’s recent report (5) of greatly 
accelerated hoarding activity following 
the substitution of “open” alleys for 
“closed” ones is especially suggestive in 
this connection. He reports that in the 
“open” situation the animal also took 
considerably more time to enter the al- 
leys. Although further behavioral de- 
tails were not reported in this study it 
may be supposed, on the basis of our 
own observations, that the increment 
in hoarding followed the strengthened 
tendency to return to the cage, once the 
rat had entered the alley. A ‘similar 
interpretation may be made of the 
greatly accelerated hoarding induced by 
exceptionally strong lighting of the al- 
ley—but not of the cage—in an unpub- 
lished study by Stellar (45; also cited 
by Morgan in 30). 

In summary: Even before it has 
reached the bin end of the alley, and 
thus prior to the onset of hoarding, the 
naive rat in the typical hoarding ex- 
periment has developed the habit of 
leaving and returning to the home cage. 
It should be emphasized that this ob- 
servation is empirically grounded and 
independent of any particular theoreti- 
cal interpretation, although the one 
given above appears to be satisfactory. 


Note on exploratory behavior. Although 
for the purpose of this analysis the uni- 
versally observed exploratory tendency of 
the rat may be simply assumed, it is never- 
theless an interesting and theoretically im- 
portant problem. In the present situation 
the particularly important role of direct 
olfactory cues is suggested. Unless the 
alleys have been thoroughly washed fol- 
lowing each hoarding test—and this has 


not been mentioned in the reports—a cer- 
tain amount of exploration may be ac- 
counted for on the basis of residual, cu- 
mulative food and animal odors. The 
secondary reinforcement resulting from 
such stimulation may thus be suggested as 
a possibly significant factor in the appear- 
ance of exploratory behavior. Visual cues 
from the bin end of the alley may also be 
mentioned as possible factors, but in view 
of the relatively poor visual abilities of 
the rat it is not likely that they are impor- 
tant in this respect. 


3. Integration of the Hoarding Habit 


The problem of the fusion of these 
independently learned habits into the 
integrated hoarding habit is resolved by 
assumption of a dual stimulating and 
reinforcing role for the food pellet. 
The following discussion outlines the 
main relationships. 

a. First hoarding activity. When the 
exploring rat finds the binful of food 
pellets at the end of the alley it behaves 
entirely in accordance with previous 
habit formation. One of the pellets is 
seized, more or less promptly, depending 
upon the current drive state of the ani- 
mal, and carried quickly back to the 
cage. Certainly no new explanation is 
needed for this response. The pre- 
experimental habit of carrying the food 
pellet about in the cage is reactivated 
under new anxiety conditions. The 
animal moves in the direction of anxiety- 
reducing cues, as it has learned to do 
previously both within and without the 
cage, and the result is that the pellet is 
returned to the cage. Release of the 
pellet follows the cessation of move- 
ment, as previously, and the first pellet- 
return response is completed. 

Following the first two or three re- 
leases of the pellet in the cage the rats 
in our experiments were typically ob- 
served to spend at least a small amount 
of time in active nibbling, again in ac- 
cordance with their previous habit for- 
mation. The extent of this feeding 





STIMULUS-RESPONSE ANALYSIS OF HoARDING HABIT IN THE RAT 85 


largely depended upon the deprivation 
condition of the animal. In his early 
study Hunt (15) noted that there ap- 
peared to be a conflict between eating 
and hoarding responses. For this rea- 
son it has been customary to allow par- 
tial satiation before the hoarding test 
proper, in order to accelerate the onset 
of hoarding behavior. Although the 
theoretical significance of this early 
feeding behavior does not appear to 
have been recognized, it may be ex- 
pected to have a direct reinforcing in- 
fluence on the preceding sequence of 
responses, in the same manner as it pre- 
viously did within the cage. This sug- 
gestion is supported by the observations 
made previously (19) on hungry rats 
which quickly shifted their feeding posi- 
tions to the alley when the pellet was 
removed immediately after its return 
into the cage. 

Once the first return with a pellet has 
been completed and its immediate feed- 


ing motives satisfied, the rat tends to 
re-enter the alley and return to the 
bin at regularly decreasing time inter- 


vals (19). The role of the food pellet 
as a secondary reinforcer now becomes 
apparent. The strong olfactory and 
visual stimuli produced by the food 
pellets may be suggested as the most 
probable factors in the reinforcement 
of the bin-return component of the 
hoarding habit. It is also possible that 
the proprioceptive stimulation resulting 
from pellet-seizure exerts some _ influ- 
ence in this respect, in addition to re- 
inforcing more directly the pellet- 
seizure response itself. 

A small amount of nibbling, as well 
as sniffing, of the food pellets has been 
observed at the bin. Rapid return to 
the bin under such conditions would be 
easily explained on the basis of re- 
inforcement principles, and the situ- 
ation would be somewhat similar to that 
in the Columbia Obstruction method of 
measuring drive. The main difference 


is that whereas in the latter situation 
the animal is arbitrarily returned to the 
starting box by the experimenter, in 
the hoarding test it returns because of 
the strong habits previously established. 

b. Subsequent development of hoard- 
ing. The typical negatively accelerated 
shape of the learning curves in our pre- 
vious hoarding study (19, Figs. 2, 3, 4) 
may be explained upon the basis of the 
accumulation of reinforcement under 
the new conditions to which the rat was 
exposed. Successive reinforcements of 
the bin-return response would be ex- 
pected to result in an increasing tend- 
ency to make this response to the 
stimulus cues produced by the open 
alley. Successive reinforcements of the 
cage-return response would similarly 
Operate to strengthen that tendency. 
The empirical evidence from many 
other hoarding studies (cf. 4, 25, 26, 31, 
46, 47) is thoroughly in accord with 
these expectations. The rat that has 
begun to hoard generally increases the 
vigor and speed of its food-carrying 
activity until in extreme cases a re- 
markable amount of work may be ac- 
complished. For example, we have ob- 
served one female rat which returned a 
peak total of 97 pellets over the forty- 
inch runway in thirty minutes—an 
average of more than one complete trip 
every twenty seconds. Similar exam- 
ples of excessive hoarding activity have 
been reported elsewhere (e.g., 25, p. 
666, footnote). There can be no doubt 
that the bin and its contents exert a 
strong influence upon the behavior of 
the rat which has had the relevant pre- 
liminary and early hoarding experiences. 

The subsequent decrement in hoard- 
ing behavior is readily explicable on 
the basis of a progressive accumulation 
of inhibitory factors associated with 
the effortfulness of the activity (42). 
Negatively accelerated extinction curves 
were found in our previous study (19, 





86 Metvin H. Marx 


Figs. 5 and 6) and other earlier work 
(26, 31, 47). 

c. Role of food deprivation. The 
development of hoarding behavior may 
be interpreted, in accordance with the 
preceding discussion, as a result of a 
certain balance between the feeding 
and the anxiety-reducing motives. So 
far as feeding motivation is concerned, 
the optimal condition appears to be one 
in which recent deprivation has been 
followed by partial satiation. As sug- 
gested above, this important point has 
been appreciated in a practical, if not 
in a theoretical, sense. 

The more rapid development of 
strong hoarding habits in recently de- 
prived rats is probably in part a func- 
tion of their higher activity level with 
consequent acceleration of the prelimi- 
nary exploratory behavior and develop- 
ment of the cage-return habit. An ad- 


ditional factor favoring the more rapid 


development of hoarding behavior in 
these animals would be the differential 
activation of the jaw-manipulating re- 
sponses that have been conditioned to 
the sight and smell of the food pellets. 
Data from the observations previously 
reported (19) support this interpreta- 
tion in that the non-deprived animals 
made over three times as many pellet- 
less trips to the bin preceding their first 
pellet return as did the recently de- 
prived animals. 

Enhancement of the stimulating 
properties of the food pellet through 
“frustration” is also suggested by the 
experiment of McCord (22), in which 
hungry animals were permitted to see 
and reach for small food pellets kept 
out of reach behind a wire mesh bar- 
rier. Such animals subsequently seemed 
to remove a greater number of pellets 
from a clip in the side of the cage than 
the controls, equally under-fed but not 
subjected to the additional “frustra- 
tion.” A similar result seems to have 
been obtained by Wolfe (51) in young 


rats which had been placed for a period 
of twenty days on an “inadequate” 
diet—one sufficient merely to maintain 
their body weight at a time when it 
would normally have been increasing. 

It may also be pointed out that al- 
though operation of the secondary re- 
inforcing factors of the kind postulated 
above is dependent upon previous acti- 
vation of response systems by relatively 
high drive states there is no require- 
ment that such states be currently ac- 
tive. Experimental demonstrations of 
this point have been provided by Brog- 
den (6), and by Saltzman and Koch 
(38). Brogden’s results seem particu- 
larly pertinent to the present situation 
in that completely satiated dogs con- 
tinued to make the instrumental re- 
sponse which brought them food, in the 
conditioning situation, even though they 
then refused to eat the food. 

d. Special conditions. In Wolfe’s 
(51) study some rats were raised on 
powdered food, and were significantly 
retarded in hoarding behavior on the 
first test day. The appreciable incre- 
ment in hoarding shown on the subse- 
quent test days may be explained on 
the basis of a relatively rapid learning 
of responses to the food pellet. Ap- 
proach and manipulatory responses 
would be expected on even the first 
encounter of such rats with a food pel- 
let. They would result from the gen- 
eralization to be expected on the basis 
of the olfactory and visual (texture) 
similarity between the food pellets and 
the powdered food, and would account 
for the slight amount of first day 
hoarding which Wolfe found. Subse- 
quent reinforcement of the kind postu- 
lated above would explain the further 
development and integration of the 
hoarding habit. 

It should also be mentioned that the 
present analysis, while directly appli- 
cable to the typical hoarding situation, 
would need to be modified to fit special 





STIMULUS-RESPONSE ANALYSIS OF HoARDING HABIT IN THE RAT 87 


circumstances. For example, when the 
pellets are placed within the home cage, 
as has been done in certain of the early 
studies on “frustration” (17, 22), obvi- 
ously the “homing” habit would not be 
involved. Learned pellet-seizure, pellet- 
carrying, and pellet-release tendencies 
would be necessary, however, in treat- 
ing such cases. In certain other hoard- 
ing studies (25, 51) in which the dis- 
tance to the food bin was considerably 
shortened, it is probable that the pre- 
hoarding exploratory behavior was much 
reduced and that pellet returns followed 
more quickly upon the first alley en- 
trances. “These cases may be consid- 
ered as simplifications, in this respect, 
of the typical hoarding situation, rather 
than as exceptions to the basic pattern. 
The use of longer alleys seems to have 
facilitated our observations by throw- 
ing into sharper relief the behavioral 
origins of the activity. 


4. Instigation and Reinforcement of the 
Fully Developed Habit 


If the hoarding habit is formed from 
a fusion of previously established habits 
the question of its subsequent instiga- 


tion and reinforcement arises. Data 
previously presented (19, Fig. 6) have 
suggested the remarkable persistence 
of this habit over a twelve week test- 
ing period in animals on unlimited 
feeding and thus, presumably, minimal 
hunger drive. In view of the high 
effortfulness of the activity its strong 
resistance to complete extinction sug- 
gests the operation of correspondingly 
strong instigating and _ reinforcing 
agents. Three specific sources of these 
may be suggested. 

a. Regular reinforcement from feed- 
ing responses. The fact that the ani- 
mals necessarily continue to make char- 
acteristic feeding responses suggests 
that the secondary reinforcement from 
the response-produced stimulation ac- 
companying jaw movements is periodi- 


cally renewed and thus effectively main- 
tained. Experimental demonstration of 
the way in which interpolated periodic 
association with primary reinforcement 
upholds the motivating power of a sec- 
ondary reinforcer has recently been of- 
fered by Saltzman (37). In the hoard- 
ing situation such periodic interpolation 
of primary reinforcement serves to 
maintain not only the originally inde- 
pendent pellet-seizure tendency but also 
the secondary reinforcing powers asso- 
ciated with the sight and smell of the 
food pellets, thus helping to account for 
the continued instigation of the alley 
runs in spite of their high degree of 
effortfulness. 

b. Spontaneous running activity. The 
kind of physical restrictions imposed 
by the small individual living cages 
typically employed in hoarding studies 
can be expected to encourage the uni- 
versally observed tendency of the nor- 
mal animal towards a certain amount 
of spontaneous activity. The physical 
dimensions of these living cages closely 
resemble those of the cages commonly 
attached to the rotating drum type of 
apparatus for measuring general run- 
ning activity. The implication of this 
similarity is that the same basic factors 
that motivate the normal, well-fed animal 
toward a certain amount of spontaneous 
activity, in the rotating drum apparatus, 
may well be expected to operate in the 
hoarding cage. The running activity 
in this case of course would take place 
in the open alley, and would thus indi- 
rectly encourage the persistence of the 
food carrying behavior. The finding 
of maximal, or near-maximal, activity 
in rats following twenty-four hours of 
enforced inactivity (39, 40) helps to 
strengthen this interpretation in view 
of a similar schedule in the usual hoard- 
ing experiment in which twenty-three 
hour periods of rest are regularly inter- 
polated in between the daily half-hour 
tests. 





88 Metvin H. Marx 


c. Drive generalization. Miller (28) 
has recently presented a theoretical and 
experimental analysis of the concept of 
drive generalization, according to which 
habits originally developed and re- 
inforced under one particular set of 
drive conditions may be subsequently 
activated by a different set of drive 
conditions. Hull’s (14) postulate 7 
deals directly with this phenomenon, 
and related experiments besides Mil- 
ler’s own experimental work may be 
found in reports by Estes (8) and Webb 
(50). The implications for the present 
problem are clear. Food-carrying hab- 
its learned under some degree of hunger 
drive and food reinforcement may sub- 
sequently be activated by drive condi- 
tions which are, superficially at least, 
quite different. The major possibili- 
ties in the present problem appear to 
be thirst and sex.* 

Evidence for the influence of thirst 
may be seen in the incidental observa- 
tion of Hunt (15) concerning facilita- 
tion of food-hoarding in an animal ac- 
cidentally deprived of water, in an ex- 
periment mentioned by Hunt (15, p. 
353, footnote) but as yet unpublished, 
and in the later experiment of Bindra 
(3) in which food pellets were often 
returned by thirsty but well-fed animals. 

The results of the previous study in 
this analysis (19) suggested the role of 
the sex drive in that consistently more 
hoarding was done by female rats, with 
their four to five day estrus cycle and 
associated periodic spurts of general 
activity (41, 48). In the experiment 
providing direct sex comparisons, the 
females regularly returned from two to 
three times as many pellets as the males. 
It may also be noted that the six rats 
selected as exceptionally active hoard- 
ers were females. Their persistent 

1 The possibility may also be mentioned of 
including the skin conditions associated with 
low temperature in this classification (cf. 
McCleary and Morgan, 21). 


hoarding over a twelve week period of 
daily tests represented a striking resist- 
ance to complete extinction (19, Fig. 
6). Comparable data are not available 
for males, whose hoarding scores were 
consistently lower. 

The sex difference does not appear 
to have been stressed in the literature 
but some relevant data may also be 
gleaned from other studies.* Wolfe’s 
original report (51) suggests a higher 
rate of food-carrying in female rats. 
Morgan, Stellar and Johnson (31) re- 
port a quicker onset and slightly more 
hoarding in females. A similar result 
was found in another experiment in our 
laboratory (23). A study by Miller 
and Postman (25) also may be ana- 
lyzed for relevant supporting data. 

With such periodic drive intensifica- 
tion associated with the estrus cycie it 
is not surprising that the specifically 
directed food-carrying activity of female 
rats develops to so high a level and is 
so markedly resistant to extinction. 


DISCUSSION 
Biological Implications 


From a general biological point of 
view the major theoretical significance 
of the present interpretation is that 
it suggests the basically non-purposive 
and learned character of a fairly com- 
plex pattern of behavior which is com- 
monly found in many species under 
natural conditions, which has long been 
regarded as essentially instinctive in 
character, and which has obvious sur- 
vival value in evolution. 

2 The single study clearly showing an oppo- 
site tendency, that of Hunt and Willoughby 
(17), does not bear directly on this sex dis- 
tinction. This follows from the fact that the 
males, which hoarded almost three times as 
many pellets as the females, were fully grown 
adults (220 days of age) while the females 
were only 55 days of age. Since ovulation 
in the rat normally does not begin until 
about 77 days (18), estrual functions were 
obviously of very doubtful importance in 
this experiment. 





StrmuLus-RESPONSE ANALYSIS OF HOARDING HABIT IN THE RAT 89 


In consideration of the evidence for 
this statement it should be remembered 
that there is an even greater need in 
the field observation than in the labora- 
tory experiment for careful considera- 
tion of the origin of a behavior pattern. 
The greater difficulty involved in ar- 
ranging the appropriate observations in 
the former case is obviously no excuse 
for making of generalizations based 
upon the adult, fully developed be- 
havior. This general tendency of the 
naturalistic wbserver may be regarded 
as one important factor in his accept- 
ance of innate determinants in hoard- 
ing behavior. It also constitutes one 
important reason for the value of sup- 
plementary laboratory experiments, 
specifically designed to delineate the 
critical variables. Such factors may 


then be assumed to operate also in the 
field situation to the extent that the 
basic conditions are correlated. 

It is suggested that the basic factors 


are so correlated in the present phe- 
nomenon. In the case of the Pine squir- 
rel, for example, the nests and certain 
nearby regions in and around the home 
trees are represented by the home cages 
of the laboratory rat; the surrounding 
grounds, beneath and around the home 
trees, are represented by the alleys in 
the laboratory apparatus; and the con- 
centration of pine cones beneath the 
home tree, and particularly beneath 
other less familiar trees, is represented 
by the concentration of food pellets in 
the bin area of the alley. There is in 
the early life of the squirrel abundant 
opportunity for /earning to leave and 
quickly return to the nest, particularly 
in response to unusual stimuli, with re- 
sultant secondary reinforcement of the 
sort postulated in the present analysis. 
There is also abundant opportunity for 
the squirrel, with its characteristic body 
structure and feeding habit repertoire, 
normally to lJearn to react to edible 
cones in basically the same way as the 


laboratory rat learns to react to the 
food pellets it finds in its environment. 

Under these conditions it is not sur- 
prising that the food-carrying habit 
should develop so easily in the Pine 
squirrel, with the resultant classification 
of it by the layman and the naturalist 
as “hoarding”—often implying the de- 
liberate storing of food materials. But 
it seems more reasonable to suppose 
that the squirrel, like the rat, is oper- 
ating on a simpler set of principles and 
is merely responding to food materials 
in its characteristically learned manner 
where and when it happens to find them. 
The result is that a large number of 
edible materials are accumulated at 
various points of strongest familiarity 
to the squirrel, in and around the trees, 
and at a time of the year shortly pre- 
ceding that when they will be most 
needed. The fact that this particular 
habit has very great utility to the ani- 
mals showing it undoubtedly helps to 
explain the success of these species in 
evolutionary survival but bears no more 
direct relationship to its ontogenetic or 
phylogenetic origin. 

A similar interpretation might be 
made for many if not all of the other 
hoarding animals. One may mention 
the universality of regular feeding hab- 
its involving jaw manipulations, the 
near universality of home regions—and 
consequent “homing” responses—in 
practically all ‘terrestrial animals, and 
the prominence of contributory factors 
like periodic hunger and_ startling 
(mildly anxiety-arousing) stimuli. The 
basic factors are thus available for early 
fusion into food-carrying habits with 
subsequent reinforcement. Their high 
frequency of occurrence helps to explain 
the prevalence of varying degrees of 
food-storing behavior, and its unique 
development in animals with especially 
well adapted basic body structures and 
habits (such as the pack rat, in which 





90 MELVIN H. Marx 


a high degree of generalization appar- 
ently occurs). 

Why closely related forms (e.g., cer- 
tain of the common squirrels) do not 
show significant hoarding activity will 
need to be explained in terms of the 
particular behavioral background of 
each form. Special tendencies, such as 
the burying of food objects, would com- 
plicate but would not necessarily change 
the basic relationships. They would 
also need to be explained in accord- 
ance with corresponding factors in the 
animal’s body structure and _ habit 
repertoire. 


Psychological Implications 


1. The present analysis may be con- 
sidered as an additional, specific demon- 
stration of the fundamental role played 
by anxiety and anxiety-reduction in the 
development and integration of behav- 
ior. The occurrence of threatening 
stimulation and consequent anxiety 
arousal and reduction appears to be 
closely knit into the formation of some 
of the most fundamental and useful 
habits. Its normal importance in ani- 
mal behavior, without the experimental 
use of electric shock, is confirmed. 

2. The fact that the present inter- 
pretation has been necessarily analytic 
in its treatment of the hoarding habit 
should not obscure the essential conti- 
nuity of the actual behavior. The em- 
pirically observed continuity, however, 
may be tentatively interpreted as a 
function primarily of the simultaneous 
stimulating and reinforcing properties 
of certain stimuli, such as the food pel- 
let. Postulation of such complex inter- 
relationships and synthesizing principles 
appears to be necessary if the empiri- 
cally observed facts of behavior are to 
be adequately explained in terms of the 
basic relationships derived from rela- 
tively simpler situations. 

3. In the face of certain types of ob- 
jections to analysis and “elementarism” 


it may be emphasized that postulation 
of basically mechanical—that is, law- 
ful—operation of learning functions 
does not necessarily require that be- 
havior be “blind” or “stupid.” In the 
present analysis evidence has been pre- 
sented (19) for the flexibility of behav- 
ior in accordance with changing condi- 
tions under lawful—and, therefore, pre- 
dictable—relationships. A_ pertinent 
example would be the progressive shift 
in feeding position shown by the milk- 
hoarding rats as they learned to carry 
milk pellets decreasing distances from 
the bin. 

Lawful operation of basic functions 
has been impressively demonstrated in 
Harlow’s (12) recent paper giving evi- 
dence for the gradual learning of in- 
sight in primates. That particular 
study also seems to offer a particularly 
effective answer to those theorists like 
Hilgard (13) and Waters (49) who 
have recently attacked Lloyd Morgan’s 
Canon—with its simple scientific de- 
mand for parsimony in explanation—as 
an obstacle in the interpretation of ani- 
mal behavior. The absence of adequate 
data to support suspected complex 
functions should certainly be regarded 
as more of a challenge to the experi- 
menter than to the principle of 
parsimony. 

4. Hoarding may be tentatively clas- 
sified as one of a large group of self- 
sustaining activities whose persistence 
apparently needs to be explained in 
terms of other motivational relation- 
ships than those originally operative 
(cf. 10). In complex habits which have 
been formed from other simpler and 
independently established habits, a cer- 
tain amount of the instigation and re- 
inforcement that accounts for their 
persistence may be mediated through 
the fused components, in the manner 
suggested above for the hoarding habit. 
That is, the apparent absence of direct 
motivational sources may be largely due 





STIMULUS-RESPONSE ANALYSIS OF HOARDING HABIT IN THE RAT 91 


to a failure to recognize the continuing 
instigation and reinforcement provided 
by the component habit systems. Ade- 
quate analysis of such complexly inte- 
grated habits may help to explain their 
superficially “autonomous” persistence. 

5. With respect to the vastly more 
complex matter of human hoarding it 
is tempting to make brief and cautious 
comment. In spite of all the obvious 
behavioral manifestations of cultural 
accretions and symbolic functions it is 
nevertheless conceivable that the basic 
pattern for at least some of the activi- 
ties classed as “hoarding” is the same 
as that described above for the food- 
carrying rat. There would be, to take 
the most common example of money, 
the same early and persistent reinforce- 
ment of the various money-acquiring 
responses. There would also be the 
relatively slow extinction of these re- 
sponses, for long after the original needs 
have been satisfied. And there would 


be the many new and potent types of in- 
stigation and reinforcement—many of 
them purely symbolic in nature—to 
continue to motivate the response sys- 


tems. Under such circumstances it is 
not surprising that certain human habits 
of acquisition should show so marked 
a development beyond the requirements 
of original needs and so strong a resist- 
ance to extinction. 


SUMMARY AND CONCLUSIONS 


A theoretical interpretation is offered , 


of the preliminary empirical observa- 
tions reported in a previous paper (19) 
on the origin and persistence of the 
hoarding habit in the laboratory rat. 
Hoarding behavior is interpreted as a 
complex habit formed primarily from 
the fusion of the earlier and independ- 
ently learned “homing” and _pellet- 
seizure habits. Previous failures to 
emphasize the learned basis of this 
behavior pattern are considered to be 
primarily due to the ignoring of the 


early, pre-hoarding phases of the beha- 
vior and the consequent failure to make 
the relevant empirical observations. 

Detailed explanations are suggested 
for the synthesis of the component hab- 
its and the origin and persistence of 
the more complex hoarding habit. The 
general concepts of secondary reinforce- 
ment, anxiety-reduction, and drive gen- 
eralization are applied. 

Further theoretical implications of 
the present analysis are discused. 

The present analysis is of interest in 
that it specifically demonstrates the 
applicability of experimentally derived 
learning principles, with experimentally 
testable consequences, to more complex 
behavioral phenomena. Further experi- 
ments explicitly designed to test certain 
of the more important implications of 
the present analysis are in progress. It 
is expected that after empirical and 
theoretical isolation of the critical vari- 
ables, and their experimental confir- 
mation, postulation of the pseudo- 
explanatory concept of “instinct” will 
be unnecessary. 


REFERENCES 


. Basu, K. W. Contribution to a theory 
of the hunger drive. J. comp. Psychol., 
1939, 28, 137-160. 

An investigation into a_ possible 
organic basis for the hunger drive. J. 
comp. Psychol., 1939, 28, 109-136. 

. Brypra, D. Water-hoarding in rats. J. 
comp. physiol. Psychol., 1947, 40, 149- 
156. 


The nature of motivation for 
hoarding food. J. comp. physiol. Psy- 
chol., 1948, 41, 211-218. 

. ——. What makes rats hoard? J. comp. 
physiol. Psychol., 1948, 41, 397-402. 

. Brocpen, W. J. Non-alimentary compo- 
nents in food-reinforcement. J. exp. 
Psychol., 1942, 30, 326-335. 

. CaLtnoun, J. B. A method for self-control 
of population growth among animals 
living in the wild. Science, 1949, 109, 
333-335. 

. Estes, W. K. A study of motivating 
conditions necessary for secondary re- 
inforcement. J. exp. Psychol., 1949, 
39, 306-310. 





MELVIN H. Marx 


. Farper, I. E. Response fixation under 
anxiety and non-anxiety conditions. J. 
exp. Psychol., 1948, 38, 111-131. 

. Fricer, J. C. “L’appetit vient en man- 

geant.” Some reflections on the self- 

sustaining tendencies. Brit. J. Psychol., 

1948, 38, 171-189. 

. Grice, G. R. The relation of secondary 

reinforcement to delayed reward in 

visual discrimination learning. J. exp. 

Psychol., 1948, 38, 1-16. 

. Hartow, H. F. The formation of learn- 

ing sets. Psycnor. Rev., 1949, 56, 

51-65. 

. Himcarp, E. R. Theories of learning. 

New York: Appleton-Century-Crofts, 

1948. 

. Hur, C. L. Principles of behavior. 

New York: Appleton-Century, 1943. 

. Hunt, J. McV. The effects of infant 
feeding-frustration upon adult hoarding 
in the albino rat. J. abn. soc. Psychol., 
1941, 36, 338-360. 

——, ScutosserG, H., Sotromon, R. L., & 
Stettar, E. Studies of the effects of 
infantile experience on adult behavior 
in rats. I. Effects of infantile feeding 
frustration on adult hoarding. J. comp. 
physiol. Psychol., 1947, 40, 291-304. 


7. ——, & Witovucusy, R. R. The effect 


of frustration on hoarding in rats. 
Psychosom. Med., 1939, 1, 311-312. 

Lone, J. A., & Evans, H. N. The oestrous 
cycle in the rat and its associated phe- 
nomena, Memoirs Univ. Calif., 1922, 
6. 

. Marx, M. H._ Experimental analysis 

of the hoarding habit in the rat. I. 

Preliminary observations. J. comp. 

physiol. Psychol. (In press.) 

. May, M. A. Experimentally acquired 

drives. J. exp. Psychol., 1948, 38, 

66-77. 

. McCreary, R. A., & Morcan, C. T. Food- 

hoarding in rats as a function of en- 

vironmental temperature. J. comp. 

Psychol., 1946, 38, 371-378. 

. McCorp, F. The effect of frustration on 

hoarding in rats. J. comp. Psychol., 

1941, 32, 531-541. 

. McKetvey, R. K., & Marx, M. H. Ef- 

fects of infantile food and water dep- 

rivation on adult hoarding in the rat. 

(In preparation.) 

. Mitter, G. A. Concerning the goal of 
hoarding behavior in the rat. J. comp. 
Psychol., 1945, 38, 209-212. 

——, & Postman, L. Individual and 
group hoarding in rats. Amer. J. Psy- 
chol., 1946, 59, 652-668. 


26. ——, & Viex, P. An analysis of the rat’s 


response to unfamiliar aspects of the 
hoarding situation. J. comp. Psychol., 
1944, 37, 221-231. 


. Mitter, N. E. Studies of fear as an ac- 


quirable drive: I. Fear as motivation 
and fear-reduction as reinforcement in 
the learning of new responses. J. exp. 
Psychol., 1948, 38, 89-101. 

Theory and experiment relating 
psychoanalytic displacement to stimulus- 
response genefalization. J. abn. soc. 
Psychol., 1948, 43, 155-178. 


. Morcan, C. T. Physiological psychology. 


New York: McGraw-Hill, 1943. 


.—. The hoarding instinct. PsycHot. 


Rev., 1947, 54, 335-341. 

, STELLAR, E., & Jounson, O. Food- 
deprivation and hoarding in rats. J. 
comp. Psychol., 1943, 35, 275-295. 


. Mowrer, O. H. A_ stimulus-response 


analysis of anxiety and its role as a re- 
inforcing agent. Psycuor. Rev., 1938, 
45, 62-91. 


. ——. Animal studies in the social modi- 


fication of organically determined beha- 
vior, 1939, Film. 

Anxiety-reduction and learning. J. 
exp. Psychol., 1940, 27, 497-516. 

, & LamoreAux, R. R. Fear as an 
intervening variable in avoidance con- 
ditioning. J. comp. Psychol., 1946, 39, 
29-50. 

, & Viex, P. An experimental ana- 
logue of fear from a sense of helpless- 
ness. J. abn. soc. Psychol., 1948, 43, 
193-200. 


. SattzMaNn, I. J. Maze learning in the 


absence of primary reinforcement: A 
study of secondary reinforcement. J. 
comp. physiol. Psychol., 1949, 42, 161- 

173. 

, & Kocu, S. The effect of low in- 
tensities of hunger on the behavior 
mediated by a habit of maximum 
strength. J. exp. Psychol., 1948, 38, 
347-370. 


39. Suirtey, Mary. Studies in activity. II: 


Activity rhythm; age and activity; ac- 
tivity after rest. J. comp. Psychol, 
1928, 8, 159-186. 


. Stecet, P. S. Activity level as a function 


of physically enforced inaction. J. 
Psychol., 1946, 21, 285-291. 


. Stonaker, J. R. Analysis of daily activity 


of the albino rat. Amer. J. Physiol., 
1925, 63, 485-503. 


. Sotomon, R. L. The influence of work 


on behavior. Psychol. Bull., 1948, 45, 
1-40. 





StrmuLus-RESPONSE ANALYSIS OF HOARDING HABIT IN THE RAT 93 


. Spence, K. W. The role of secondary 48. Wanc, G. H. The relation between spon- 
reinforcement in delayed reward learn- taneous activity and estrous cycle in the 
ing. Psycuor. Rev., 1947, 54, 1-8. white rat. Comp. Psychol. Monogr., 

. StectaR, E. The effect of epinephrine, 1923, 2, No. 6. 
insulin, and glucose upon hoarding in 49. Waters, R. H. Morgan’s canon and 
rats. J. comp. Psychol., 1943, 36, anthropomorphism. PsycHoL. Rev., 

1939, 46, 534-540. 


21-31. 
_—. Studies of the effects of the infan- 50. Wess, W. B. The motivational aspect of 
an irrelevant drive in the behavior of 


tile feeding frustration on the adult 4 h 
hoarding of the white rat. Unpublished ~ white rat. J. exp. Psychol., 1949, 
Master’s Thesis, Brown Univ., 1942. Phe = 1-14. 
Ses CH Wie wits a ee 51. Wotre, J. B. An exploratory study of 
: ee ge Te hee ey r food-storing in rats. J. comp. Psychol., 
perience and deprivation in the onset of 1939. 28. 97-108 
hoarding behavior in the rat. J. comp. 52. Seem P. T Food-secking pw 
' Psychol., 1943, 36, 47-55. fective process, and learning. PsycHOoL. 
e VIEK, Pus & MILLER, G. A. The cage aS a Rev 1949 56 98-121 
factor in hoarding. J. comp. Psychol., 7 alters ; 
1944, 37, 203-210. [MS. received July 8, 1949] 





TOWARD A STATISTICAL THEORY OF LEARNING * 


BY WILLIAM K. ESTES 


Indiana University 


Improved experimental techniques 
for the study of conditioning and simple 
discrimination learning enable the pres- 
ent day investigator to obtain data 
which are sufficiently orderly and re- 
producible to support exact quantita- 
tive predictions of behavior. Analogy 
with other sciences suggests that full 
utilization of these techniques in the 
analysis of learning processes will de- 
pend to some extent upon a comparable 
refinement of theoretical concepts and 
methods. The necessary interplay be- 
tween theory and experiment has been 
hindered, however, by the fact that 
none of the many current theories of 
learning commands general agreement 
among researchers. It seems likely that 
progress toward a common frame of 
reference will be slow so long as most 
theories are built around verbally de- 
fined hypothetical constructs which are 
not susceptible to unequivocal verifica- 
tion. While awaiting resolution of the 
many apparent disparities among com- 
peting theories, it may be advantageous 
to systematize well established empiri- 
cal relationships at a peripheral, statis- 
tical level of analysis. The possibility 
of agreement on a theoretical frame- 
work, at least in certain intensively 
studied areas, may be maximized by 
defining concepts in terms of experi- 
mentally manipulable variables, and 
developing the consequences of assump- 
tions by strict mathematical reasoning. 

This essay will introduce a series of 


* For continual reinforcement of his efforts 
at theory construction, as well as for many 
specific criticisms and suggestions, the writer 
is indebted to his colleagues at Indiana Uni- 
versity, especially Cletus J. Burke, Douglas 
G. Ellson, Norman Guttman, and William S. 
Verplanck. 


94 


studies developing a statistical theory of 
elementary learning processes. From 
the definitions and assumptions which 
appear necessary for this kind of for- 
mulation, we shall attempt to derive 
relations among commonly used meas- 
ures of behavior and quantitative ex- 
pressions describing various simple 
learning phenomena. 


PRELIMINARY CONSIDERATIONS 


Since propositions concerning psy- 
chological events are verifiable only 
to the extent that they are reducible to 
predictions of behavior under specified 
environmental conditions, it appears 
likely that greatest economy and con- 
sistency in theoretical structure will 
result from the statement of all funda- 
mental laws in the form 


R= f(S), 


where R and S represent behavioral 
and environmental variables respectively. 
Response-inferred laws, as for example 
those of differential psychology, should 
be derivable from relationships of this 
form. The reasoning underlying this 
position has been developed in a recent 
paper by Spence (8). Although devel- 
oped within this general framework, the 
present formulation departs to some ex- 
tent from traditional definitions of S 
and R variables. 

Many apparent differences among 
contemporary learning theories seem to 
be due in part to an oversimplified defi- 
nition of stimulus and response. The 
view of stimulus and response as ele- 
mentary, reproducible units has always 
had considerable appeal because of its 
simplicity. This simplicity is deceptive, 
however, since it entails the postulation 
of various hypothetical processes to ac- 





TOWARD A STATISTICAL THEORY OF LEARNING 95 


count for observed variability in be- 
havior. In the present formulation, we 
shall follow the alternative approach of 
including the notion of variability in 
the definitions of stimulus and response, 
and investigating the theoretical conse- 
quences of these definitions. - 

It will also be necessary to modify 
the traditional practice of stating laws 
of learning in terms of relations between 
isolated stimuli and responses. At- 
tempts at a quantitative description of 
learning and extinction of operant behav- 
ior have led the writer to believe that 
a self-consistent theory based upon the 
classical S-R model may be difficult, if 
not impossible, to extend over any very 
wide range of iearning phenomena with- 
out the continual addition of ad hoc 
hypotheses to handle every new situ- 
ation. A recurrent difficulty might be 
described as follows. In most formula- 
tions of simple learning, the organism 
is said originally to “do nothing” in 
the presence of some stimulus; during 
learning, the organism comes to make 
some predesignated response in the pres- 
ence of the stimulus; then during ex- 
tinction, the response gradually gives 
way to a state of “not responding” 
again. But this type of formulation 
does not define a closed or conservative 
system in any sense. In order to derive 
properties of conditioning and extinc- 
tion from the same set of general laws, 
it is necessary to assign specific proper- 
ties to the state of not responding 
which is the alternative to occurrence 
of the designated response. One solu- 
tion is to assign properties as needed 
by special hypotheses, as has been done, 
for example, in the Pavlovian concep- 
tion of inhibition. In the interest of 
simplicity of theoretical structure, we 
shall avoid this procedure so far as 
possible. 

The role of competing reactions has 
been emphasized by some writers, but 
usually neglected in formal theorizing. 


The point of view to be developed here 
will adopt as a standard conceptual 
model a closed system of behavioral and 
environmental variables. In any spe- 
cific behavior-system, the environmental 
component may include either the en- 
tire population of stimuli available in 
the situation or some specified portion 
of that population. The behavioral 
component will consist in mutually ex- 
clusive classes of responses, defined in 
terms of objective criteria; these classes 
will be exhaustive in the sense that they 
will include all behaviors which may 
be evoked by that stimulus situation. 
Given the initial probabilities of the 
various responses available to an organ- 
ism in a given situation, we shall expect 
the laws of the theory to enable predic- 
tions of changes in those probabilities 
as a function of changes in values of 
independent variables. 


DEFINITIONS AND ASSUMPTIONS 


1. R-variables. It will be assumed 
that any movement or sequence of 
movements may be analyzed out of an 
organism’s repertory of behavior and 
treated as a “response,”’ various prop- 
erties of which can be treated as de- 
pendent variables subject to all the laws 
of the theory. (Hereafter we shall ab- 
breviate the word response as R, with 
appropriate subscripts where neces- 
sary.) In order to avoid a common 
source of confusion, it will be necessary 
to make a clear distinction between the 
terms R-class and R-occurrence. 

The term R-class will always refer 
to a class of behaviors which produce 
environmental effects within a specified 
range of values. This definition is not 
without objection (cf. 4) but has the 
advantage of following the actual prac- 
tice of most experimenters. It may be 
possible eventually to coordinate R- 
classes defined in terms of environ- 
mental effects with R-classes defined in 
terms of effector activities. 





96 Wittiam K. Estes 


By R-occurrence we shall mean 
a particular, unrepeatable behavioral 
event. All occurrences which meet the 
defining criteria of an R-class are 
counted as instances of that class, and 
as such are experimentally interchange- 
able. In fact, various instances of an 
R-class are ordinarily indistinguishable 
in the record of an experiment even 
though they may actually vary with 
respect to properties which are not 
picked up by the recording mechanism. 

Indices of tendency to respond, e.g., 
probability as defined below, always 
refer to R-classes. 

These distinctions may be clarified 
by an illustration. In the Skinner-type 
conditioning apparatus, bar-pressing is 
usually treated as an R-class. Any 
movement of the organism which re- 
sults in sufficient depression of the bar 
to actuate the recording mechanism is 
counted as an instance of the class. 


The R-class may be subdivided into 
finer classes by the same kind of cri- 


teria. We could, if desired, treat de- 
pression of a bar by the rat’s right 
forepaw and depression of the bar by 
the left forepaw as instances of two 
different classes provided that we have 
a recording mechanism which will be 
affected differently by the two kinds of 
movements and mediate different rela- 
tions to stimulus input (as for example 
the presentation of discriminative stim- 
uli or reinforcing stimuli). If proba- 
bility is increased by reinforcement, 
then reinforcement of a right-forepaw- 
bar-depression will increase the proba- 
bility that instances of that subclass 
will occur, and will also increase the 
probability that instances of the broader 
class, bar-pressing, will occur. 

2. S-variables. For analytic pur- 
poses it is assumed that all behavior 
is conditional upon appropriate stimu- 
lation. It is not implied, however, that 
responses can be predicted only when 
eliciting stimuli can be identified. Ac- 


cording to the present point of view, 
laws of learning enable predictions of 
changes in probability of response as a 
function of time under given environ- 
mental conditions. 

A stimulus, or stimulating situation, 
will be regarded as a finite population 
of relatively small, independent, en- 
vironmental events, of which only a 
sample is effective at any given time. 
In the following sections we shall desig- 
nate the total number of elements as- 
sociated with a given source of stimula- 
tion as S (with appropriate subscripts 
where more than one source of stimu- 
lation must be considered in an experi- 
ment), and the number of elements ef- 
fective at any given time as s. It is 
assumed that when experimental condi- 
tions involve the repeated stimulation 
of an organism by the “same stimulus,” 
that is by successive samples of ele- 
ments from an S-population, each sam- 
ple may be treated as an independent 
random sample from S. It is to be ex- 
pected that sample size will fluctuate 
somewhat from one moment to the next, 
in which case s will be treated as the 
average number of elements per sample 
over a given period. 

In applying the theory, any portion 
of the environment to which the or- 
ganism is exposed under uniform condi- 
tions may be considered an S-popula- 
tion. The number of different S’s said 
to be present in a situation will depend 
upon the number of independent ex- 
perimental operations, and the degree of 
specificity with which predictions of 
behavior are to be made. If the experi- 
menter attempts to hold the stimulating 
situation constant during the course of 
an experiment, then the entire situa- 
tion will be treated as a single S. If 
in a conditioning experiment, a light 
and shock are to be independently ma- 
nipulated as the CS and US, then each 
of these sources of stimulation will be 





TOWARD A STATISTICAL THEORY OF LEARNING 97 


treated as a separate S-population, and 
so on. 

It should be emphasized that the 
division of environment and behavior 
into elements is merely an analytic 
device adopted to enable the applica- 
tion of the finite-frequency theory of 
probability to behavioral phenomena. 
In applying the theory to learning ex- 
periments we shall expect to evaluate 
the ratio s/S for any specific situa- 
tion from experimental evidence, but 
for the present at least no operational 
meaning can be given to a numerical 
value for either S or s taken separately. 

3. Probability of response. Proba- 
bility will be operationally defined as 
the average frequency of occurrence of 
instances of an R-class relative to the 
maximum possible frequency, under a 
specified set of experimental conditions, 
over a period of time during which the 
conditions remain constant. In accord- 
ance with customary usage the term 
probability, although defined as a rela- 
tive frequency, will also be used to ex- 
press the likelihood that a response will 
occur at a given time. 

4. Conditional relation. This relation 
may obtain between an R-class and any 
number of the elements in an S-popula- 
tion, «nd has the following implications. 

(a) If a set of x elements from an S 
are conditioned to (i.e., have the con- 
ditional relation to) some R-class, R,, 
at a given time, the probability that the 
next response to occur will be an in- 
stance of R, is x/S. 

(b) If at a given time in an S- 
population, x, elements are conditioned 
to some R-class, R,, and x, elements are 
conditioned to another class, R,, then 
x, and x, have no common elements. 

(c) If all behaviors which may be 
evoked from an organism in a given 
situation have been categorized into 
mutually exclusive classes, then the 
probabilities attaching to the various 
classes must sum to unity at all times. 


We consider the organism to be always 
“doing something.” If any arbitrarily 
defined class of activities may be se- 
lected as the dependent variable of a 
given experiment, it follows that the 
activity of the organism at any time 
must be considered as subject to the 
same laws as the class under considera- 
tion. Any increase in probability of 
one R-class during learning will, then, 
necessarily involve the reduction in 
probability of other classes; similarly, 
while the probability of one R de- 
creases during extinction, the probabili- 
ties of others must increase. In other 
words, learning and unlearning will be 
considered as transfers of probability 
relations between R-classes. 

5. Conditioning. It is assumed that 
on each occurrence of a response, R,, 
all new elements (i.e., elements not al- 
ready conditioned to R,) in the mo- 
mentarily effective sample of stimulus 
elements, s, become conditioned to R,. 

An important implication of these 
definitions is that the conditioning of a 
stimulus element to one R automatically 
involves the breaking of any pre-existing 
conditional relations with other R’s. 

6. Motivation. Experimental opera- 
tions which in the usual terminology 
are said to produce motives (e.g., food- 
deprivation) may affect either the com- 
position of an S or the magnitude of 
the s/S ratio. Detailed discussion of 
these relations is beyond the scope of 
the present paper. In all derivations 
presented here we shall assume motivat- 
ing conditions constant throughout an 
experiment. 

7. Reinforcement. This term will be 
applied to any experimental condition 
which ensures that successive occur- 
rences of a given R will each be con- 
tiguous with a new random sample of 
elements from some specified S-popula- 
tion. Various ways of realizing this 
definition experimentally will be dis- 
cussed in the following sections. 





98 WituraM K. Estes 


StmpLeE CONDITIONING: REINFORCE- 
MENT BY CONTROLLED ELICITATION 


Let us consider first the simplest type 
of conditioning experiment. The sys- 
tem to be described consists of a sub- 
population of stimulus elements, S,, 
which may be manipulated independ- 
ently of the remainder of the situation, 
S, and a class, R, of behaviors defined 
by certain measurable properties. By 
means of a controlled original stimulus, 
that is, one which has initially a high 
probability of evoking R, it is ensured 
that an instance of R will occur on 
every trial contiguously with the sam- 
ple of stimulus elements which is pres- 
ent. In the familiar buzz-shock con- 
ditioning experiment, for example, S, 
would represent the population of stimu- 
lus elements emanating from the sound 
source and R would include all move- 
ments of a limb meeting certain speci- 
fications of direction and amplitude; 
typically, the R to be conditioned is a 
flexion response which may be evoked 
on each training trial by administra- 
tion of an electric shock. 

Designating the mean number of ele- 
ments from S, effective on any one trial 
as S,, and the number of elements from 
S, which are conditioned to R at any 
time as x, the expected number of new 
elements conditioned on any trial will 
be 


If the change in x per trial is rela- 
tively small, and the process is assumed 
continuous, the right hand portion of 
(1) may be taken as the average rate 
of change of x with respect to number 
of trials, 7, at any moment, giving 


dx (S. — x) 
ar pa, (2) 


This differential equation may be in- 


tegrated to yield 
z= Se ” (S. a xo)e~@?, (3) 


where x, is the initial value of x, and g 
represents the ratio s./S,. Thus x will 
increase from its initial value to ap- 
proach the limiting value, S,, in a nega- 
tively accelerated curve. A method of 
evaluating x in these equations from 
empirical measures of response latency, 
or reaction time, will be developed in a 
later section. 

If the remainder of the situation has 
been experimentally neutralized, the 
probability of R in the presence of a 
sample from S, will be given by the 
ratio x/S,. Representing this ratio by 
the single letter », and making appro- 
priate substitutions in (3), we have the 
following expression for probability of 
R as a function of the number of rein- 
forced trials. 


p=1-—(1— pes. (3’) 


Since we have not assumed any spe- 
cial properties for the original (or un- 
conditioned) stimulus other than that 
of regularly evoking the response to be 
conditioned, it is to be expected that 
the equations developed in this section 
will describe the accumulation of con- 
ditional relations in other situations 
than classical conditioning, provided 
that other experimental operations func- 
tion to ensure that the response to be 
learned will occur in the presence of 
every sample drawn from the S-popula- 
tion. 


OPERANT CONDITIONING: REINFORCE- 
MENT BY CONTINGENT STIMULATION 


In the more common type of experi- 
mental arrangement, various termed 
operant, instrumental, trial and error, 
etc. by different investigators, the re- 
sponse to be learned is not elicited by 
a controlled original stimulus, but has 
some initial strength in the experimental 





TOWARD A STATISTICAL THEORY OF LEARNING 99 


situation and occurs originally as part 
of so-called “random activity.” Here 
the response cannot be evoked concur- 
rently with the presentation of each new 
stimulus sample, but some of the same 
effects can be secured by making 
changes in the stimulating situation 
contingent upon occurrences of the re- 
sponse. Let us consider a situation of 
this sort, assuming that the activities of 
the organism have been catalogued and 
classified into two categories, all move- 
ment sequences characterized by a cer- 
tain set of properties being assigned to 
class R and all others to the class R,, 
and that members of class R are to be 
learned. 

If changes in the stimulus sample are 
independent of the organism’s behavior, 
we should expect instances of the two 
response classes to occur, on the aver- 
age, at rates proportional to their initial 
probabilities. For if x elements from 


the S-population are originally condi- 


tioned to R, then the probability of R 
will be x/S; the number of new ele- 
ments conditioned to R if an instance 
occurs will be s[(S —x)/S], s again 
representing the number of stimulus 
elements ‘in a sample; and the mathe- 
matically expected increase in x will 
be the product of these quantities, 
sx[(S — x)/S*]. At the same time, 
the probability of R, will be (S — x)/S, 
and the number of new elements condi- 
tioned to R, if an instance occurs will 
be sx/S; multiplying these quantities, 
we have sx[(S — x)/S*] as the mathe- 
matically expected decrease in x. Thus 
we should predict no average change 
in x under these conditions. 

In the acquisition phase of a léarn- 
ing experiment two important restric- 
tions imposed by the experimenter tend 
to force a correlation between changes 
in the stimulus sample and occurrences 
of R. The organism is usually intro- 
duced into the experimental situation 
at the beginning of a trial, and the 


' 

trial lasts until the pre-designated re- 
sponse, R, occurs. For example, in a 
common discrimination apparatus the 
animal is placed on a jumping stand at 
the beginning of each trial and the trial 
continues until the animal leaves the 
stand; a trial in a runway experiment 
lasts until the animal reaches the end 
box, and so on. Typically the stimulat- 
ing situation present at the beginning 
of a trial is radically changed, if not 
completely terminated, by the occur- 
rence of the response in question; and a 
new trial begins under the same condi- 
tions, except for sampling variations, 
after some pre-designated interval. The 
pattern of movement-produced stimuli 
present during a trial may be changed 
after occurrences of R by the evocation 
of some uniform bit of behavior such as 
eating or drinking; in some cases the 
behavior utilized for this purpose must 
be established by special training prior 
to a learning experiment. In _ the 
Skinner box, for example, the animal is 
trained to respond to the sound of the 
magazine by approaching it and eating 
or drinking. Then when operation of 
the magazine follows the occurrence of 
a bar-pressing response during condi- 
tioning of the latter, the animal’s re- 
sponse to the magazine will remove it 
from the stimuli in the vicinity of the 
bar and ensure that for an interval of 
time thereafter the animal will not be 
exposed to most of the S-population; 
therefore the sample of elements to 
which the animal will next respond may 
be considered very nearly a new random 
sample from S. 

In the simplest operant conditioning 
experiments it may be possible to 
change almost the entire stimulus sam- 
ple after each occurrence of R (com- 
plete reinforcement), while in other 
cases the sampling of only some re- 
stricted portion of the S-population is 
correlated with R (partial reinforce- 
ment). We shail consider the former 





100 


case in some detail in the remainder of 
this section. 

By our definition of the conditional 
relation, we shall expect all R-classes 
from which instances actually occur on 
any trial to be conditioned to stim- 
ulus elements present on that trial. 
The first movement to occur will be 
conditioned to the environmental cues 
present at the beginning of the trial; 
the next movement will be _ condi- 
tioned to some external cues, if the 
situation is not completely constant 
during a trial, and to proprioceptive 
cues from the first movement, and so on, 
until the predesignated response, R, 
occurs and terminates the trial. If 
complete constancy of the stimulating 
situation could be maintained, the most 
probable course of events on the next 
trial would be the recurrence of the 
same sequence of movements. In prac- 
tice, however, the sample of effective 
stimulus elements will change somewhat 
in composition, and some _ responses 
which occur on one trial may fail to oc- 
cur on the next. The only response 
which may never be omitted is R, since 
the trial continues until R occurs. This 
argument has been developed in greater 
detail by Guthrie (4). In order to 
verify the line of reasoning involved, 
we need now to set these ideas down in 
mathematical form and investigate the 
possibility of deriving functions which 
will describe empirical curves of learn- 
ing. 

Since each trial lasts until R occurs, 
we need an expression for the probable 
duration of a trial in terms of the 
strength of R. Suppose that we have 
categorized all movement sequences 
which are to be counted as “responses” 
in a given situation, and that the mini- 
mum time needed for completion of a 
response-occurrence is, on the average, h. 
For convenience in the following devel- 
opment, we shall assume that the mean 
duration of instances of class R is ap- 


WitiiAM K. Estes 


proximately equal to that of class R,. 
Let the total number of stimulus ele- 
ments available in the experimental 
situation be represented by S, the sam- 
ple effective on any one trial by s, and 
the ratio s/S by g. The probability, , 
of class R at the beginning of any trial 
will have the value x/S; if this value 
varies little within a trial, we can readily 
compute the probable number of re- 
sponses (of all classes) that will occur 
before the trial is terminated. The 
probability that an instance of R will 
be the first response to occur on the 
trial in question is p; the probability 
that it will be the second is p (1—); 
the probability that it will be the third 
is p(1 — p)*; etc. If we imagine an in- 
definitely large number of trials run un- 


‘der identical conditions, and represent 


the number of response occurrences on 
any trial by n, we may weight each pos- 
sible value of n by its probability (i.e., 
expected relative frequency) and obtain 
a mean expected value of m. In sym- 
bolic notation we have 


ni = Inp(1 — p)*"! = prn(i — pp)". 


The expression inside the summation 
sign will be recognized as the general 
term of a well-known infinite series with 
the sum 1/(1—(1—>))*. Then we 
have, by substitution, 


n= p/(1 — (1 — p))? = 1/p. 


Then L, the average time per trial, will 
be the product of the expected number 
of responses and the mean time per 
response. 


L = th = h/p = Sh/x. 


Since R will be conditioned to all new 
stimulus elements present on each trial, 
we may substitute for x its equivalent 
from equation (3), dropping the sub- 





TowArD A STATISTICAL THEORY OF LEARNING 


scripts from S, and s,, and obtaining 


Sh 
S = (S — xe? 
h 
~  _ a = We (4) 
Ly 


Thus, Z will decline from an initial 
value of L, (equal to Sh/x,) and ap- 
proach the asymptotic minimum value 
h over a series of trials. 

A preliminary test of the validity of 
this development may be obtained by 
applying equation (4) to learning data 
from a runway experiment in which the 
conditions assumed in the derivation are 
realized to a fair degree of approxima- 
tion. In Fig. 1 we have plotted acquisi- 
tion data reported by Graham and 
Gagné (3). Each empirical point rep- 
resents the geometric mean latency for 
a group of 21 rats which were rein- 
forced with food for traversing a simple 





| 





elevated runway. The theoretical curve 


80 


- 


25 
1-0 9648 e012" 


GEOMETRIC MEAN LATENCY 








TRIAL (T+#1) 


Fic. 1. Latencies of a runway 
during conditioning, obtained from published 
data of Graham and Gagné (3), are fitted 
by a theoretical curve derived in the text: 


response 


101 
in the figure represents the equation 


a 
1 — .9648e—-"7 

where values of L,, 4, and q have been 
estimated from the data. This curve 
appears to give a satisfactory gradua- 
tion of the obtained points and, it might 
be noted, is very similar in form to the 
theoretical acquisition curve developed 
by Graham and Gagné. The present 
formulation differs from theirs chiefly 
in including the time of the first re- 
sponse as an integral part of the learn- 
ing process. The quantitative descrip- 
tion of extinction in this situation will 
be presented in a forthcoming paper. 

In order to apply the present theory 
to experimental situations such as the 
Skinner box, in which the learning pe- 
riod is not divided into discrete trials, 
we shall have to assume that the in- 
tervals between reinforcements in those 
situations may be treated as “trials”’ 
for analytical purposes. Making this 
assumption, we may derive an expres- 
sion for rate of change of conditioned 
response strength as a function of time 
in the experimental situation, during a 
period in which all responses of class R 
are reinforced. 

L, as defined above, will represent the 
time between any two occurrences of R. 
Then if we let ¢ represent time elapsed 
from the beginning of the learning pe- 
riod to a given occurrence of R, and T 
the number of occurrences (and there- 
fore reinforcements) of R, we have from 
the preceding development 

L = Sh/x. 
Since LZ may be considered as the in- 
crement in time during a trial, we can 
write the identity 


AT A" 
Substituting for Ax/AT its equivalent 
from (1), without subscripts, and for 





102 


AT/At its equivalent from the preced- 
ing equation, we have 


s(S — x)x 


hS (5) 


If the change in x per reinforcement is 
small and the process is assumed con- 
tinuous, the right hand portion of equa- 
tion (5) may be taken as the value of 
the derivative dx/d¢ and integrated 
with respect to time— 


where B = s/Sh. In general, this equa- 
tion defines a logistic curve with the 
amount of initial acceleration depend- 
ing upon the value of x,. Curves of 
probability (x/S) vs. time for S = 100, 
B= 0.25, and several different values 
of x, are illustrated in Fig. 2. 


1.00 





WitiiAM K. Estes 


Since we are considering a situation 
in which a reinforcement is administered 
(or a new “trial” is begun) after each 
occurrence of R, we are now in a posi- 
tion to express the expected rate of 
occurrence of R as a function of time. 
Representing rate of occurrence of R 
by r = dR/dt, and the ratio 1/h by w, 
we have 


and if we take the rate of R at the be- 
ginning of the experimental period as 
r, = wx,/S this relation becomes 


w 
y= ' (7) 
:4 2 


To 





To illustrate this function, we have 
plotted in Fig. 3 measures of rate of 





1 





10 


TIME 


Illustrative curves of probability vs. time during conditioning; parameters of the curves 
are the same except for the initial x-values. 





Towarp A STATISTICAL THEORY OF LEARNING 


il a 
1+25e-#*" 








TIME 


Fic. 3. 


IN MINUTES 


Number of responses per minute during conditioning of a bar-pressing habit in a 


single rat; the theoretical curve is derived in the text. 


responding during conditioning of a bar- 
pressing response by a single rat. The 
apparatus was a Skinner box; motiva- 
tion was 24 hours thirst; the animal had 
previously been trained to drink out of 
the magazine, and during the period il- 
lustrated was reinforced with water for 
all bar-pressing responses. Measures 
of rate at various times were obtained 
by counting the number of responses 
made during the half-minute before 
and the half-minute after the point 
being considered, and taking that value 
as an estimate of the rate in terms of 
responses per minute at the midpoint. 
The theoretical curve in the figure rep- 
resents the equation 


13 

~ 4 + 25e~-Ht* 

A considerable part of the variability 
of the empirical points in the figure is 
due to the inaccuracy of the method 
of estimating rates. In order to avoid 
this loss of precision, the writer has 
adopted the practice of using cumula- 
tive curves of responses vs. time for 


r 


most purposes, and fitting the cumula- 
tive records with the integral of equa- 
tion(7): 


4 Wm), (8) 


Ww w 


R=wlt-<l 
=wW +B og 


where R represents the number of re- 
sponses made after any interval of time, 
t, from the beginning of the learning 
period. The original record of re- 
sponses vs. time, from which the data 
of Fig. 3 were obtained, is reproduced 
in Fig. 4. Integration of the rate equa- 
tion for this animal yields 


R = 13t + 125 logy (.038 + .962e~-**), 


Magnitudes of R computed from this 
equation for several values of ¢ have 
been plotted in Fig. 4 to indicate the 
goodness of fit; the theoretical curve has 
not been drawn in the figure since it 
would completely obscure most of the 
empirical record. In an experimental 
report now in press (2), equation (8) 
is fitted to several mean conditioning 
curves for groups of four rats; in all 
cases, the theoretical curve accounts for 





WrituiaM K. Estes 


RESPONSES 





i 


TIME IN MINUTES 





Fic. 4. Reproduction of the original cu- 
mulative record from which the points of Fig. 
3 were obtained. Solid circles are computed 
from an equation given in the text. 


‘ more than 99 per cent of the variance 
of the observed R values. Further 
verification of the present formulation 
has been derived from that study by 
comparing the acquisition curves of suc- 
cessively learned bar-pressing habits, 
obtained in a Skinner-type condition- 
ing apparatus which included two bars 
differing only in position. It has been 


found that the parameters w and s/S 


can be evaluated from the conditioning 
curve of one bar response, and then 
used to predict the detailed course of 
conditioning of a second learned re- 
sponse. 

The overall accuracy of these equa- 
tions in describing the rate of condi- 
tioning of bar-pressing and runway re- 
sponses should not be allowed to obscure 
the fact that a small but systematic 
error is present in the initial portion 
of most of the curves. It is believed 
that these disparities are due to the fact 
that experimental conditions do not 
usually fully realize the assumption that 
only one R-class receives any reinforce- 
ment during the learning period. A 
more general formulation of the theory, 
which does not require this assumption, 
will be discussed in the next section. 


PARTIAL REINFORCEMENT 


It can be shown that a given response 
may be “learned” in a trial and error 
situation provided that some sub-popu- 


lation of stimulus elements is so con- 
trolled by experimental conditions that 
each sample of elements drawn from 
it is contiguous with an occurrence of 
the response. The sort of derivation 
needed to handle this kind of partial 
reinforcement will be sketched briefly 
in this section. A more detailed treat- 
ment will be given, together with rele- 
vant experimental evidence, in a paper 
now in preparation. It should be em- 
phasized that we are using the term 
“partial” to refer to incomplete change 
of the stimulus sample on each occur- 
rence of a given response, and not to 
periodic, or intermittent reinforcement. 

Consider a behavior system involv- 
ing two classes of competing behaviors, 
R and R,., which may occur in a situa- 
tion, S, composed of two independently 
manipulable sub-populations, S, and S,. 
Experimental conditions are to ensure 
that of the sample, s, of elements stimu- 
lating the organism at any time, ele- 
ments from S, remain effective until 
terminated by the occurrence of R, 
while elements from S, remain effective 
until terminated by the occurrence of 
R,. This kind of system might be il- 
lustrated by a Skinner box in which the 
entire stimulus sample is not terminated 
by occurrence of the bar-pressing re- 
sponse; for example, if the box is il- 
luminated, the visual stimulation will 
be relatively unaffected by bar-pressing 
but will be terminated if the animal 
closes its eyes (the latter behavior 
being, then, an instance of R,). 

Let x represent the total number of 
elements from S conditioned to R at a 
given time, x, the number of elements 
from S, conditioned to R, T, and T, 
the numbers of occurrences of R and 
R, prior to the time in question, and q 
the ratio s/S. By reasoning similar to 
that utilized in deriving equations (2) 
and (5), we may obtain for the aver- 
age rate of change of x, with respect to 





TOWARD A STATISTICAL THEORY OF LEARNING 


T, at any time 


dx, 


* Aiealeg GompeN 2 
= 9(S; — xr). (9) 


This may be integrated to yield 
x = 5, ie (S, iar’ Xw)e~e"*, 


which is identical in form with equa- 
tion (2). 

The other component of x, (x — 2,), 
will decrease as these elements become 
conditioned to the competing response 
class, R,, according to the following 
relations. 


d(x — xy) _ 


dT, S 


= — g(x — x,) 


S, (S, — %) 


(10) 


— s(S—S,) (x — x) 
‘= S,) 








(11) 
and the integral, 


X — Xp = (Xo — Xw)e~*™. (12) 

It will be observed that an analogous 
set of equations could be written for 
changes in the number of elements con- 
ditioned to R,, and that the argument 
could be extended to any number of 
mutually exclusive classes of responses. 
From these relations it is not difficult 
to deduce differential equations which 
may be at least numerically integrated 
to yield curves giving probability of 
occurrence of each response class as a 
function of number of reinforcements. 
We shall not carry out the derivations 
here, but shall point out a number of 
properties of the curves obtained which 
will be evident from inspection of equa- 
tions (10) and (12). 

1. Regardless of the initial probabili- 
ties, the behavior system will tend to 
a state of equilibrium in which the final 
mean probability of R will be S,/S 
and the final mean probability of R, will 
be (S — S,)/S. 

2. If the number of elements from S$ 
conditioned to R at the start of an 


105 


experiment is greater than S,, the prob- 
ability of R will decrease until the 
equilibrium is reached. (Of course all 
statements made here about R have 
analogues for R,.) 

3. If the number of elements from 
S conditioned to R at the start of an 
experiment is less than S,, the prob- 
ability of R will increase until the equi- 
librium value is reached. 

4. If all elements originally condi- 
tioned to R belong to the sub-population 
S,, then the curve relating probability 
to number of reinforcements will be 
identical with equation (3’) except for 
the asymptote, which will be S,/S 
rather than unity. 

5. If some of the elements originally 
conditioned to R do not belong to S,, 
but x, is less than S,, then the curve 
relating probability to number of rein- 
forcements will rise less steeply at first 
than equation (3’), and may even have 
an initial positively accelerated limb. 

It will be noted that from the present 
point of view, conditioning and extinc- 
tion are regarded simply as two aspects 
of a single process. In practice we 
categorize a given experiment as a 
study of conditioning or a study of 
extinction depending upon which be- 
haviors are being recorded. It seems 
quite possible that both conditioning 
and extinction always occur concur- 
rently in any behavior system, and that 
the common practice of regarding them 
as separate processes is based more on 
tradition and the limitations of record- 
ing apparatus than upon rational con- 
siderations. In the present formula- 
tion, reinforcement is treated as a quan- 
titatively graded variable with “pure 
extinction” at one end of a continuum. 
Any portion of an S-population may be 
related to an R-class by experimental 
conditions which produce a correlation 
between stimulus sampling and R- 
occurrences. Under given conditions 
of reinforcement an R-class may in- 





106 


crease or decrease in probability of oc- 
currence over a series of trials depend- 
ing upon whether the momentary prob- 
ability is less than or greater than the 
equilibrium value for those conditions. 


DISCUSSION 


The foregoing sections will suffice to 
illustrate the manner in which problems 
of learning may be handled within the 
framework of a statistical theory. The 
extent to which the formal system de- 
veloped here may be fruitfully applied 
to interpret experimental phenomena 
can only be answered by a considerable 
program of research. A study of con- 
current conditioning and extinction of 
simple skeletal responses which realizes 
quite closely the simplified conditions 
assumed in the derivations of the pres- 
ent paper has been completed, and a 
report is now in press. Other papers 
in preparation will apply this formula- 
tion to extinction, spontaneous recovery, 
discrimination, and related phenomena. 

The relation of this program to con- 
temporary theories of learning requires 
little comment. No attempt has been 
made to present a “new” theory. It 
is the purpose of our investigation to 
clarify some of the conceptions of learn- 
ing and discrimination by stating im- 
portant concepts in quantitative form 
and investigating their interrelation- 
ships by mathematical analysis. Many 
similarities will be noted between func- 
tions developed here and “homologous” 
expressions in the quantitative formu- 
lations of Graham and Gagné (3) and 
of Hull (6). A thorough study of those 
theories has influenced the writer’s 
thinking in many respects. Rather 
than build directly on either of those 
formulations, I have felt it desirable to 
explore an alternative point of view 
based on a statistical definition of en- 
vironment and behavior and doing 
greater justice to the theoretical views 
of Skinner and Guthrie. A statistical 


WituiaM K. Estes 


theory seems to be an inevitable de- 
velopment at the present stage of the 
science of behavior; agreement on this 
point may be found among writers of 
otherwise widely diverse viewpoints, 
e.g., Brunswik (1), Hoagland (5), 
Skinner (7), and Wiener (9). It is to 
be expected that with increasing rigor 
of definition and continued interplay 
between theory and experiment, the 
various formulations of learning will 
tend to converge upon a common set of 
concepts. 

It may be helpful to outline briefly 
the point of view on certain contro- 
versial issues implied by the present 
analysis. 

Stimulus-response terminology. An 
attempt has been made to overcome 
some of the rigidity and oversimplifica- 
tion of traditional stimulus-response 
theory without abandoning its principal 
advantages. We have adopted a defini- 
tion of stimulus and response similar to 
Skinner’s (7) concept of generic classes, 
and have given it a statistical interpre- 
tation. Laws of learning developed 
within this framework refer to behavior 
systems (as defined in the introductory 
section of this paper) rather than to 
relations between isolated stimulus- 
response correlations. 

The learning curve. This investi- 
gation is not intended to be another 
search for “the learning function.” 
The writer does not believe that any 
simple function will be found to ac- 
count for learning independently of 
particular experimental conditions. On 
the other hand, it does seem quite pos- 
sible that from a relatively small set of 
definitions and assumptions we may be 
able to derive expressions describing 
learning under various specific experi- 
mental arrangements. 

Measures of behavior. Likelihood of 
responding has been taken as the pri- 
mary dependent variable. Analyses pre- 
sented above indicate that simple rela- 





TOWARD A STATISTICAL THEORY OF LEARNING 


tions can be derived between proba- 
bility and such common experimentally 
obtained measures as rate of responding 
and latency. 

Laws of contiguity and effect. Avail- 
able experimental evidence on simple 
learning has seemed to the writer to 
require the assumption that temporal 
contiguity of stimuli and behavior is a 
necessary condition for the formation 
of conditional relations. At the level 
of differential analysis, that is of laws 
relating momentary changes in behav- 
ior to changes in independent variables, 
no other assumption has proved neces- 
sary at the present stage of the investi- 
gation. In order to account for the 
accumulation of conditional relations in 
favor of one R-class at the expense of 
others in any situation, we have ap- 
pealed to a group of experimental op- 
erations which are usually subsumed 
under the term “reinforcement” in cur- 
rent experimental literature. Both 
Guthrie’s (4) verbal analyses and the 
writer’s mathematical investigations in- 
dicate that an essential property of re- 
inforcement is that it ensures‘ that suc- 
cessive occurrences of a given R will be 
contiguous with different samples from 
the available population of stimuli. We 
have made no assumptions concerning 
the role of special properties of certain 
after-effects of responses, such as drive- 
reduction, changes in affective tone, etc. 
Thus the quantitative relations devel- 
oped here may prove useful to investi- 
gators of learning phenomena regardless 
of the investigators’ beliefs as to the 
nature of underlying processes. 


SUMMARY 


An attempt has been made to clarify 
some issues in current learning theory 
by giving a statistical interpretation to 
the concepts of stimulus and .response 
and by deriving quantitative laws that 


govern simple behavior systems. De- 
pendent variables, in this formulation, 
are classes of behavior samples with 


107 


common quantitative properties; inde- 
pendent variables are statistical dis- 
tributions of environmental events. 
Laws of the theory state probability 
relations between momentary changes in 
behavioral and environmental variables. 

From this point of view it has been 
possible to derive simple relations be- 
tween probability of response and sev- 
eral commonly used measures of learn- 
ing, and to develop mathematical ex- 
pressions describing learning in both 
classical conditioning and instrumental 
learning situations under simplified con- 
ditions. 

No effort has been made to defend 
the assumptions underlying this formu- 
lation by verbal analyses of what 
“really” happens inside the organism 
or similar arguments. It is proposed 
that the theory be evaluated solely by 
its fruitfulness in generating quantita- 
tive functions relating various phenom- 
ena of learning and discrimination. 


REFERENCES 


1. Brunswik, E. Probability as a determiner 
of rat behavior. J. exp. Psychol., 1939, 
25, 175-197. 

2. Estes, W. K. Effects of competing reac- 
tions on the conditioning curve for 
bar-pressing. J. exp. Psychol. (in 
press). 

3. Granam, C. H.,& Gaon, R. The acquisi- 
tion, extinction, and spontaneous re- 
covery of a conditioned operant re- 
sponse. J. exp. Psychol., 1940, 26, 
251-280. 

. Guturie, E. R. Psychological facts and 
psychological theory. Psychol Bull., 
1946, 43, 1-20. 

. Hoactanp, H. The Weber-Fechner law 
and the all-or-none theory. J. gen. 
Psychol., 1930, 3, 351-373. 

. Hurt, C. L. Principles of behavior. 
York: Appleton-Century, 1943. 

. SKINNER, B. F. The behavior of organisms. 
New York: Appleton-Century, 1938. 

. Spence, K. W. The nature of theory con- 
struction in contemporary psychology. 
Psycuor. Rev., 1944, 51, 47-68. 

. Wrener, N. Cybernetics. New York: 
Wiley, 1948. 


[ MS. received July 15, 1949] 


New 








BEHAVIOR, PERCEPTION AND THE PSYCHOPHYSICAL 
METHODS * 


BY C. H. GRAHAM 
Psychological Laboratory, Columbia University 


In a forthcoming review (Graham, 
10), it is stated that the “psychophysi- 
cal” methods constitute one of the most 
fruitful ways of obtaining data that, 
formulated in stimulus-response terms, 
may provide the ensemble of quantita- 
tive relations necessary to define and 
explain the field of visual perception. 
When procedures other than psycho- 
physical methods are used, and in par- 
ticular, when responses are not properly 
restricted by “instruction stimuli,” the 
data obtained are in terms of “intro- 
spective” reports that are neither easily 
interpretable quantitatively nor clas- 
sifiable at a level higher than social 
conversation. 

The observable variables of the psy- 
chophysical experiment are responses on 
the one hand, and stimulus operations 
on the other. The stimulus-response 
function represented by a given psycho- 
physical curve describes how differen- 
tial, restricted responses correlate with 
such stimulus operations as the presen- 
tation of systematically varied objects, 
words, and energies. Such data, it is 
asserted, are discrimination data. 

The validity of the position repre- 
sented by these statements may or may 
not be obvious. In any case the data 
of the psychophysical experiment in- 
volve, at various levels of analysis, con- 
siderable degrees of complexity, and 
further supporting discussion may be 
desirable. The present account is con- 
cerned, then, with a consideration of the 


1 This account was prepared under Project 
NR 142-404; Contract N6onr-271, Task Order 
IX, between Columbia University and the 
Office of Naval Research, U. S. Navy. Re- 
production in whole or in part permitted for 
any purpose of the United States Government. 


stimulus operations and responses en- 
countered in the various types of psy- 
chophysical experiment. A later part 
of the discussion will consider some 
theoretical aspects of the ensemble of 
relations that specifies the field of visual 
perception. 
GENERAL 


The history of the psychophysical 
methods (Titchener, 35; Boring, 4) is 
a history of the development of a meth- 
odology for specifying relations between 
stimuli and what, in the opinions of dif- 
ferent workers, were, variously: sensa- 
tion (Fechner, 6, pp. 1-12), sense dis- 
tance (Titchener, 34, pp. xxi—xxvii; 
35, pp. cxvi-cxliv), error (Fullerton and 
Cattell, 8, pp. 23-26), and response 
(Johnson, 22, 23; Graham, 9; Berg- 
mann and Spence, 3). In the writer’s 
opinion the presumed variables of sensa- 
tion, sense distance and error have little 
meaning except in terms of a response 
definition or reference, and the long 
controversial history of the problem 
seems to bear out this conclusion by 
high-lighting inconsistencies that devel- 
oped in the absence of response speci- 
fication. 

Elaborate quantitative procedures 
have been worked out for the stimulus- 
response relations of the psychophysical 
experiment. The literature on this topic 
is voluminous, and the reader is re- 
ferred to Titchener (35), Woodworth 
(39, chapters 17 and 18), Guilford (14) 
and Johannsen (21) for extended treat- 
ments. 

An operational description of a psy- 
chophysical experiment requires a state- 
ment of an instruction stimulus, and 
very often such a statement is sufficient 


108 





BEHAVIOR, PERCEPTION AND THE PsYCHOPHYSICAL METHODS 


to provide reliable data. Nevertheless, 
the advancement of psychological sci- 
ence requires not only that the instruc- 
tion stimulus be stated but that its 
effects be understood. The instruc- 
tion stimulus may be thought of as the 
“selector” of a given class of discrimina- 
tions, and it is an important parameter 
of any relation existing between re- 
sponses and their associated aspects of 
stimuli. The stimulus-response formu- 
lae resulting from psychophysical ex- 
periments must remain incomplete so 
long as appropriate instruction-stimulus 
parameters remain unanalyzed and in- 
completely specified (as, e.g., at the 
level of restating an original wording). 

A stimulus may be defined as some 
energy function that is correlated with 
a response. The particular energy func- 
tion specific to a given stimulus-response 
relation depends upon a great many Cir- 
cumstances. In certain cases, for exam- 
ple, in the matter of visual threshold, the 
relevant energy function is intensity, a 
variable that may be specified within 
the theoretical system provided by 
physics. In other cases, the energy 
functions are not fruitfully described in 
physical terms; when response is not 
an easily expressible function of a 
physical aspect of a stimulus but seems 
to depend simply on the presence or 
absence of a given class of events, some 
other method of classification may be 
proper. For example, it may be more 
immediately fruitful to consider the 
energy functions as words, objects, etc. 

Consider an example in the field of 
aesthetics. One may order a group of 
pictures according to the frequencies of 
occurrence of the responses “Pleasant” 
and “Unpleasant”; and in fact the 
presentation of each picture may be 
considered to be a stimulating opera- 
tion that relates to a response quantified 
according to its frequency of occurrence. 
However, it must be observed that the 
quantified response is not an easily rec- 


109 


ognizable function of a quantified as- 
pect of stimulus. In fact, the present 
basis for classifying such stimuli is 
in terms of the responses that they 
elicit (i.e., the pictures are classified 
as “pleasant” or “unpleasant”). Ex- 
periments of this sort are usually of 
the type that is concerned with the 
question of reproducibility of response 
quantities within a population of sub- 
jects. The method of paired compari- 
son (Guilford, 14), which will not be 
discussed here, is most appropriate for 
such investigations. Obviously in this 
type of work it is of first importance to 
conduct a program on the nature of the 
effects produced by the stimulus opera- 
tions and their interaction with “con- 
ditions” of the organism determined by 
instruction. The program, of course, 
may result in analysis different from 
that traditional to the field of percep- 
tion, but there is no reason for restrict- 
ing one’s efforts to traditional paths 
when an answer to a problem is at 
issue. 

In the descriptions that follow, em- 
phasis will be placed on applications of 
the psychophysical methods to experi- 
ments in which the stimulus variable is 
not classified on a response basis. In 
addition, we shall pay attention to situ- 
ations that make use of a maximum of 
response restriction, i.e., that allow for 
a minimum of response categories. For 
example, the treatment of the method 
of constant stimuli with more than two 
response categories is not considered. 


THE METHOD or CONSTANT STIMULI 


The experiment on the determination 
of visual threshold may be used as an 
example of the application of this 
method. The dark adapted subject is 
instructed to maintain regard on a fixa- 
tion light and to report “Yes” if, after 
a signal, he discriminates another light 
in the field, and “No” if he does not. 
The method involves a determination 





Whee eae 


: 
. 
) 
. 
Fy 
) 
: 


110 C. H. GraHam 


of how the frequency of occurrence of 
one of the two permissible, contrasting 
responses (the “Yes” response) varies 
with changes in the intensity of stimu- 
lus light. Thus, for visual threshold, 
one may plot the percentage of occur- 
rence (P,) of the positive response 
“Yes” against some function of the 
intensity (e.g., the logarithm) over a 
small range of intensities. Above the 
critical range the response occurs 100 
per cent of the time, and below the 
range the response occurs at a zero (or 
“chance guessing’) level. When two 
categories of response are used, the per- 
centage of occurrence of the negative 
response (“No”) is simply 100 — Pp, 
where P, is the percentage of occurrence 
of the positive response. 

In this type of experiment one wishes 
to know the value of the stimulus in- 
tensity function that corresponds to a 
given percentage of occurrence of the 
positive response, i.e., the threshold. 
The particular percentage chosen may 
be arbitrary, but it will probably be 
determined by the statistical hypothe- 
sis that is being tested. For example, 
the mean threshold stimulus value cal- 
culated on the basis of a Gaussian dis- 
tribution is determined at the 50 per 
cent point of the cumulative function, 
whereas the per cent point for the mean 
stimulus value for a Poisson distribu- 
tion (Fry, 7) varies with other parame- 
ters of the function (e.g., the second 
moment) and approaches 50 only as 
a limiting value. In either case theo- 
retical assumptions are made with re- 
spect to the relationship between stimu- 
lus and response, and the experimental 
data provide a test of theory (Mueller, 
27). 

The constant stimulus method may be 
extended to the intensity discrimination 
experiment. A common form of this 
experiment presents the subject with a 
field illuminated by an intensity 7. The 
subject becomes light adapted to this 


intensity and is instructed to report 
“Yes” if, after a signal, he discriminates 
a momentary brightening of the field 
(provided by a variable, momentarily 
added intensity, AJ). If he does not 
discriminate a momentary brightening, 
he is to report “No.” Frequencies of 
occurrence of “Yes” are plotted as a 
function, for example, of log A/, and 
the threshold is determined as pre- 
viously. (A curve in which the fre- 
quency of occurrence of “Yes” responses 
is plotted against an appropriate in- 
tensity variable is sometimes called a 
“frequency of seeing” curve.) An hy- 
pothesis as to the form of the frequency 
distribution will determine the thresh- 
old, and indeed, it may also test an 
hypothesis as to photochemical (Hecht, 
Shlaer and Pirenne, 19) or behavior 
activity. When the experiment is re- 
peated at various values of J, a series of 
threshold AJ values is obtained, and on 
this basis one may plot an intensity 
discrimination curve. 

Figure 1 gives frequency of seeing 
curves as functions of log AJ at several 
levels of adapting intensity, 7. The 
data are for subject R in Mueller’s ex- 
periment (28). A frequency of seeing 
curve was obtained twice at each value 
of J, and the J value at which each 
curve was determined is given as a 
numerical label for each associated pair 
of curves. 

At a low value of J, the frequency of 
seeing curve occurs at low values of 
log A/; at a high value of J, the fre- 
quency of seeing curve is displaced to 
high values of log AJ. The curves as- 
sume intermediate positions for inter- 
mediate values of 7. Each curve pre- 
sents a typical sigmoid appearance. 
The threshold value of AJ, i.e., AJ;, may 
be determined for each curve as the 
AI value that occurs at a 50 percent 
frequency of seeing (or some other 
fiducial percentage determined on an 
appropriate theoretical basis). 





BEHAVIOR, PERCEPTION AND THE PsyCHOPHYSICAL METHODS 





‘A 
S 


° 








FREQUENCY — % 
° 
°o 


w 
° 











2.8 3.0 3.2 


LOG AI (PHOTONS) 


Fic. 1. 


ber associated with each pair of curves is log 7 in photons. 


of constant stimuli. 


The log AJ; values as determined in 
the manner described may be used 
to provide an intensity discrimination 
curve. When each A/; value is divided 
by its corresponding adapting intensity 
value, J, the Fechner ratio is given. 
This ratio, or some function of it, e.g., 
its log, dictated by appropriate theo- 
retical considerations, may be plotted 
against log J to give an intensity dis- 
crimination curve. Such a curve is 





LOG a1,/z 
° 


as 


4 











“1 fe) I 2 
LOG I- PHOTONS 
Fic. 2. The intensity discrimination curve 
obtained by Mueller (28). AJ; values are de- 
termined at the fifty per cent frequency of 
seeing value for each curve in Fig. 1. Log 
(AJ, /1) is then computed and plotted against 
log J. Al;/I is high for low intensities and 
decreases to a final limiting value at high in- 
tensities. The curve drawn through the data 
is the curve of equation (8) which represents 
Hecht’s theory (17) of intensity discrimination. 


Frequency of seeing curves for subject R in Mueller’s experiment (28). 


The num- 
Data were obtained by the method 


presented in Fig. 2. This curve shows 
that log (AJ;/J) is high at low values of 
I and decreases to a final limiting value 
at a high intensity. 


Tue METHOD oF SINGLE STIMULI 


In using this method (Wever and 
Zener, 38; Volkmann, 37) the experi- 
menter instructs the subject to respond 
in terms of a set number of categories 
to a single stimulus that may be varied 
in a given dimension. The simplest 
situation requires only two response 
categories, and under this condition the 
method is identical with the method of 
constant stimuli as used in the determi- 
nation of absolute thresholds. (See, 
for example, the experiment of Veniar 
[36] on difference thresholds for shape 
distortion of geometrical squares.) 

When three response categories are 
used, the method is comparable to the 
method of constant stimuli applied to 
the absolute threshold with an “Uncer- 
tain” category. 

It is only when more than three re- 
sponse categories are used that the 
method differs from the conventional 
constant method. Under such circum- 





112 


stances the method of single stimuli 
may become a “rating” method that 
entails some difficult problems of re- 
sponse specification. 


Tue MetuHop or Limits 


This method employs a procedure 
whereby the stimulus variable under 
consideration is varied in small con- 
secutive steps until a response change 
occurs. For example, in determining 
the absolute visual threshold, the in- 
tensity of the stimulus area is varied in 
small steps of increasing intensity, un- 
til the subject’s response changes from 
“No” to “Yes.” The intensity value 
(or, for example, its logarithm) used 
in computing the threshold is taken as 
a value interpolated between the value 
corresponding to the last negative re- 
sponse (“No”) and that corresponding 
to the first positive response (“Yes”’). 
The order of presentation of intensity 
is reversed for the second determina- 
tion of the threshold datum, and the 
final threshold is determined as an aver- 


C. H. GraHAM 


age of all the data obtained in a number 
of ascending and descending orders. 

The method may be used for obtain- 
ing many types of threshold, including 
the intensity discrimination threshold. 
For example, in an experiment on the 
latter, the subject, light adapted to a 
given intensity, 7, may be instructed 
to respond positively (“Yes”) to a dis- 
criminated momentary increase in in- 
tensity, AJ, and negatively (“No”) to 
no change. The A/ values correspond- 
ing to a change in response are found 
for successive series (in ascending and 
descending orders) and their frequen- 
cies may be plotted against AJ or some 
function of AJ (for example, log A/). 
The threshold value of the A/ function 
is determined as an average of the dis- 
tribution. 

Figure 3A gives a distribution of AJ 
values (each corresponding to a change 
in response) for a single subject at an 
adapting intensity, 7, of 28,300 photons. 
This value of / is identical with the 
highest intensity of Fig. 1, and the AJ; 




















FREQUENCY 











-O3 -0.2 


LOG Al 


Fic. 3. 


-O.! 


- PHOTONS 


Curves giving distributions of AJ values (each 


corresponding to a change in response) for a single sub- 


ject at three adapting intensities. 
A: Data obtained experimentally at an J 


ing series only. 


Method of limits, ascend- 


value of 28,300 photons; this J value is comparable to the 


highest J value of Fig. 1. 


330 photons by the method described in the text. 
distribution computed for 0.033 photon. 


B: A distribution computed for 


C: A 
The data of A 


were provided by Dr. C. G. Mueller. 





BEHAVIOR, PERCEPTION AND THE PsYCHOPHYSICAL METHODS 


values are obtained under similar con- 
ditions except for the method used: 
the present case employs the method of 
limits with ascending series only; the 
case of Fig. 1, the method of constant 
stimuli. 

Figure 3B is a hypothetical distribu- 
tion of A/’s for an adapting intensity, 
I, of 330 photons. Figure 3C is a pre- 
sumed distribution obtainable at 0.033 
photon.” Each curve in Figs. 3B or 
3C (or any other figure that might 
present a curve for an intervening value 
of J) provides an average value of log 
AI that is, by definition, a threshold 
value, i.e., log AJ;. Such values of log 
Al, at various values of J give, on the 
appropriate calculation, an intensity dis- 
crimination curve comparable to that 
of Fig. 2. 

Similar considerations as to the place 
of hypothesis in fitting data apply to 
this method as apply to the method of 
constant stimuli. 


THe Metuop or ADJUSTMENT 


In this method the subject is pre- 
sented with a constant, standard stimu- 
lus and a variable stimulus. He is in- 
structed to manipulate the variable 
stimulus until it “matches” (i.e., with 
respect to a given attribute it is indis- 
criminable from) the standard stimulus 
(Woodworth, 39, chapter 17). The 
average value of the stimulus variable 
resulting from many individual match- 
ings (adjustments) is taken as the 
“equality” value. The difference be- 
tween the value of the standard stimu- 


2 The curves of Figs. 3B and 3C have been 
drawn so that (a) the dispersion of each is 
proportional to the slope of the corresponding 
psychophysical function (i.e., for a compar- 
able adapting intensity, 7) of Fig. 1, and (b) 
so that the average A/ is in appropriate ratio 
(comparable to that given in Fig. 1 for a cor- 
responding 7) to the experimentally deter- 
mined value for 28,300 photons. (Photons 
= (10/7) X pupil area in square millimeters 
X brightness in millilamberts.) 


113 


lus variable and the “equality” value 
gives the “constant error,” a measure 
of “accuracy” of setting. The stand- 
ard deviation of the stimulus values 
resulting from the adjustment responses 
gives an inverse measure of precision of 
setting. 

It may be argued that any given 
adjustment involves a serialized set of 
discriminations involving different re- 
sponses and changing stimuli. For ex- 
ample, it is conceivable that when the 
subject makes a matching adjustment 
by a “bracketing” procedure, he pre- 
sents himself with a variable stimulus 
to which he makes implicit responses, 
analogous to the overt responses of the 
method of limits. If such is the case, 
the method of adjustment has much in 
common with the method of limits. 
Such a supposition is theoretical and 
calls for a specification of the response 
components of the adjustment. No 
data on the problem exist at present, 
and an answer can only be arrived at 
by observation of a subject in the psy- 
chophysical experiment. 

As yet, then, the method does not 
provide a true measure of a discrimi- 
nable difference since only one response 
category is used, i.e., adjustment; there- 
fore, no frequency of response or re- 
sponse change may be noted. However, 
classification of response may be ac- 
complished arbitrarily by the following 
device. Responses that lead to stimu- 
lus settings lying within plus or minus 
one standard deviation (or any other 
appropriate statistical fiducial limit) of 
the mean of the settings may be classed 
in the category “Equal.” Responses 
that give settings more than a stand- 
ard deviation below the mean are 
classed in the category “Less.” Re- 
sponses that give settings more than a 
standard deviation above the mean are 
classed in the category “Greater.” By 
this device, a so-called just discrimi- 
nable difference (i.e., the standard de- 








: 
7 
: 
! 
: 
3 
} 


ete RE PP NE em 


05 Oe 


C. H. GraHAM 





FRE QUENCY 





0.00009! 














. SECONDS OF ARC 


Fic. 4. 


Frequency distributions of stereoscopic settings 


computed in terms of 7, in seconds of arc. A: A distribu- 
tion of settings for an intensity of 0.000091 millilambert. 
B: A distribution for an intensity of 186 millilamberts. 
The data are for a single subject in the experiment by 


Mueller and Lloyd (29). 


viation) may be obtained, but it is im- 
portant to notice that the difference is 
not signalled by a true change in re- 
sponse, but rather by a classification 
of response on the basis of stimulus 
changes. All considerations of statisti- 
cal “fits” of data that apply to the other 
methods apply to the method of adjust- 
ment. 

Figure 4A presents data, obtained by 
the method of adjustment, for the de- 
termination of the least discriminable 
difference angle for stereoscopic vi- 
sion, 7. 

The situation in which the observa- 
tions were made involved t. _ stereo- 
scope with micrometer attachment de- 
scribed by Mueller and Lloyd (29). 
This stereoscope provides movement of 
a vertical line in the right-eye field, 
while the comparable line in the left- 
eye field remains stationary. These 
lines, when “fused,” provide the com- 
parison stimulus. The standard stimu- 
lus is provided by “fusion” of three 
short, vertical fiducial lines for each 
eye. Movement of the micrometer 
causes the vertical comparison line in 
the right-eye field to move horizontally 
in the frontal stereoscopic plene and 
thus causes the “apparent” spa_.] posi- 


tion of the comparison stimulus to vary 
with respect to the standard stimuli. 

On theoretical grounds (Graham, 10) 
it may be shown that a given setting of 
the micrometer corresponds to a dif- 
ference in the convergence angles exist- 
ing for the standard and comparison 
stimuli. This difference angle is called 
y; for a true “lining up” of standard 
and comparison stimuli its value is 
zero. Negative values of » mean a set- 
ting made nearer the subject than the 
true setting for “equality”; positive 
values, a setting farther from the sub- 
ject than the true setting. 

The distribution of »’s for a low light 
intensity (0.000091 millilambert) is 
shown in Fig. 4A for Subject CGM of 
the Mueller and Lloyd (29) experi- 
ment. Figure 4B gives the distribution 
of settings for the same subject at the 
highest intensity used, 186 millilamberts. 
(In Mueller and Lloyd’s experiment, 
from which these data come, distribu- 
tions of »’s were obtained at 8 intensi- 
ties intervening between the two dis- 
cussed here.) The threshold difference 
angle, m, is taken to be some measure 
of the variability of the » settings. In 
the present case the average deviation 
was used. is the conventional in- 
verse measure of stereoscopic acuity. 





BEHAVIOR, PERCEPTION AND THE PsyCHOPHYSICAL METHODS 














_— 
° 


Fic. 5. The threshold, 7+, for stereoscopic 
vision, in seconds, as a function of field bright- 
ness in millilamberts. 7; is calculated for each 
value of intensity as the average deviation of 
a distribution similar to those given in Fig. 4A 
and 4B. The curve, the one obtained by 
Mueller and Lloyd (29), is the curve of 
equation (8) and represents Hecht’s descrip- 
tion (17). : is taken on theoretical grounds 
(29) to be equal to 4J/I. 


When a threshold difference angle, , 
is determined for 10 distributions of » 
(each distribution for a different in- 
tensity). and plotted against the loga- 
rithm of the prevailing intensity 7, the 
graph of Fig. 5 is obtained. This graph 
shows the way in which »,, a measure 
of stereoscopic acuity defined in terms 
of a geometrical theory of space percep- 
tion, varies as a function of the pre- 
vailing intensity, 7. is high (ie., 
acuity is low) at low intensities and de- 
creases as intensity increases, until, at 
high intensities, it reaches a final, low 
limiting value. The experimental values 
on the graph are averaged data for two 
subjects. 


THE MetTHOD oF SENSE RATIOS 


The methods of constant stimuli and 
limits may be used to signal absolute 
thresholds and just discriminable dif- 
ferences; the method of adjustment may 
be used to signal “equivalence” of 
stimuli, and by an arbitrary statistical 
device, a defined just discriminable 
difference. The method of ratios 
(Delboeuf, 5; Stevens and Volkmann, 
33) has been used for the determina- 


115 


tion of a stimulus quantity that results 
from the instruction to adjust a stimu- 
lus so that it “bisects the interval” be- 
tween two reference stimuli. 

In the method of sense ratios the 
subject is presented with, for example, 
two illuminated patches, the patches 
differing in intensity of illumination 
(Merkel, 26). For any one experi- 
mental series the illumination of the 
patches is maintained constant at the 
given two levels. Between the two 
constant illuminated areas is another 
stimulus surface on which intensity of 
illumination is variable. The subject 
is instructed to respond “Yes” when he 
discriminates an intensity on the mid- 
dle area as “halfway” between the in- 
tensities on the outer areas. His re- 
sponse to all other intensities is to be 
“No.” 

Within a small range of intensities the 
frequency of response “Yes” is dis- 
tributed regularly, and the stimulus 
quantity for “bisection” may be deter- 
mined as the average intensity value of 
the distribution. As to the meaning of 
the quantity: whether, for example, it 
represents a point on a stimulus con- 
tinuum that corresponds to a signalling 
of half of something in the subject, that 
is a matter of theory. At any rate re- 
producible values are obtained; they 
have as parameters the intensities of 
the two comparison stimuli. 


THE REACTION TIME METHOD 


This method may be illustrated by a 
procedure followed by Steinman (31). 
The subject is light adapted to any one 
of a series of intensities, 7. At any level 
of J, the subject is presented with vari- 
ous values of A/, and a determination of 
reaction time is made at each value of 
AJ. By this procedure a curve may 
be obtained for reaction time as a func- 
tion of AJ at each value of J. The 
method gives a quantitative relation be- 
tween stimulus and response, but, since 





ee nee 


: 
t 
' 
: 
q 
; 


ee ee 


Pew A ete 


116 C. H. GraHam 


only one response (that varies in reac- 
tion time) is involved, we cannot ob- 
tain a just discriminable difference. 

However, by the method of plotting 
reaction time against log AJ for each 
value of log J, one gets a family of 
curves (one for each value of log /) 
that are parallel on the log AJ parame- 
ter. This treatment was not used by 
Steinman, but recomputation of her 
data shows that it would probably be a 
true description if her log J range were 
increased by 1 to 2 log units and prob- 
ably for A/ durations of less than 0.03 
second (Keller, 24). By cutting the 
curves at constant reaction times one 
may obtain, for a given reaction time, 
a value of AJ, A/,, for a given value of /. 
Depending on the reaction time chosen, 
one may then plot a conventional in- 
tensity discrimination curve in which 
log (A/,/I) is plotted against log /, 
reaction time being the parameter that 
sets the ordinate position of each curve. 
Thus a certain stimulus value may be 
referred to some constant response in 
a manner comparable to that encoun- 
tered in the constant stimulus method, 
when difference thresholds, each refer- 
able to the same constant response oc- 
currence,.are determined as functions of 
a given parameter (e.g., adapting in- 
tensity, etc.). 

The method has been used by Hecht 
(16) in experiments on Mya; experi- 
ments on human subjects have been 
few but promising (Woodworth, 39). 
Hartline (15), Graham and Riggs (12) 
and Graham and Hartline (11) have 
used a similar method with measures 
other than reaction time in experiments 
on electrical recording from the eyes 
of lower organisms. 


SomE THEORY 


An elaborate theoretical structure has 
arisen on the base provided by the psy- 
chophysical methods and has developed 
along the following lines. Fechner (6) 


started with the questionable generaliza- 
tion, based on Weber’s lifted weight 
experiments, that the just discriminable 
increment in weight is proportional to 
the reference weight; i.e, AW,;<W. 
He then proposed, on the basis of other 
evidence, that the relation holds, not 
only for lifted weights, but for other 
just discriminable aspects of stimuli 
and for other sensory systems: i.e., in 
general, AS,;«S, where AS; is the just 
discriminable increment in stimulus and 
S is the reference stimulus. Thus 
AS,/S = C, where C is a constant. The 
latter expression is called Weber’s law, 
an empirical generalization that we now 
know to be invalid for large ranges of S. 
Fechner carried the enpirical ap- 
proximation to a theoretical level by 
making the following assumption. Let 
AR, an hypothesized constant increment 
in sensation, be proportional to AS; and 
inversely proportional to S. In these 
terms, AR « C(= AS,/S); and Weber’s 
law is a measure of sensation by the 
concept that C is proportional to AR. 
Suppose, now, that AR and AS; may 
be considered so small (certainly AS, 
is not) that they may be written as the 
differentials dR and dS. Then dR= 
k dS/S, where k is the constant of pro- 
portionality relating C and AR. On 
integration, we obtain the expression 


R=k log (S/S,). (1) 


Equation (1), which states a relation 
between R (sensation) and S (stimu- 
lus), is Fechner’s law, in which S, is 
the threshold. For a treatment that 
has certain advantages over the present 
one, see Wright (40). 

An interesting aspect of the testing 
of Fechner’s law was concerned with 
experiments on the “bisection of sense 
distances.” Consider two standard 
stimuli, S,, the greater, and S,, the 
smaller. A test stimulus, S;, must be 
adjusted by the subject until it seems 
to be “half-way” between S, and S,. 





BEHAVIOR, PERCEPTION AND THE PsyCHOPHYSICAL METHODS 


Now, by Fechner’s law, 
R; = k (log S; -- log So), (2) 


R2 = k (log Sz: — log So), = (3) 
and 
R, = k (log S; — log Sy). — (4) 


On subtracting equation (4) from (2), 
we have 


Ri — Ri = k (log S; — log S,); (5) 


and, similarly, on subtracting (3) from 
(4), we have 


R, — R: = k (log S; — log S2). (6) 


For bisection, R, — R; = R; — R.; and 
we may set the right hand side of (5) 
equal to the right hand side of (6). 
The result is: 


log (S;/S,) = log (S:/S2), 


S, = VS\°S3; (7) 


that is, the bisection threshold is the 
geometric mean of the two comparison 
stimuli. The agreement of data with 
prediction is not good, at least for vision 
(Titchener, 35). 

Recent criticisms serve to emphasize 
objections to Fechner’s position that 
have been proposed at many stages of 
the history of psychophysics. For ex- 
ample, Johnson (23) points out some 
logical inconsistencies in the Fechnerian 
formulation. Johnson (22) and Gra- 
ham (9) emphasize the problem of re- 
sponse in the psychophysical experiment 
and the kind of behavior functions that 
must be considered. Bergmann and 
Spence (3) show that scaling methods 
result in the scaling of stimuli. There 
is little need to go into all of the tech- 
nical objections to Fechner’s law, be- 
cause of the primary fact that it fails 
at its very roots: AS,/S is not constant; 
nor has it ever been seriously taken to 
describe the action of all stimulus as- 
pects. (For example, it has never been 
considered to apply to hue discrimina- 


117 


tion.) Later experiments, particularly 
in vision, show the lack of constancy 
of AS,/S plainly. In Fig. 2, if Fechner’s 
law were true, log (A/,//) should be 
constant across the graph, and the curve 
would be a horizontal line. Of course 
it is not, and similar disagreements 
with theory have been shown by many 
experimenters since the days of Kénig 
and Brodhun (25). Hecht (17) has 
predicted that the equation for visual 
intensity discrimination should be | 


Al, 1 P 

pees E ba aem| As. 
where c and K are constants; A/;, the 
discriminable increment in intensity; 
and J, the reference intensity. The 
equation does, in fact, fit intensity dis- 
crimination data (as it does in Fig. 2) 
when certain experimental conditions 
are fulfilled. The equation is derived, 
on theoretical grounds, from a _ con- 
sideration of photochemical processes 
in the eye. (See also K. E. Baker [2] 
for an analysis of the hypothesis in re- 
lation to vernier acuity and H. D. 
Baker [1] for an extension to light 
adaptation.) 


Some Broap CONSIDERATIONS 


It is probably true that. psychological 
theory cannot be psychophysics on a 
grand scale, nor can one look for uni- 
versal principles in psychophysical con- 


cepts. Rather, it seems that the appli- 
cation of psychophysical method is a 
first step in the setting up of conditions 
that allow for a scientific description 
of a stimulus-response relation. Prop- 
erly conceived, it specifies variables. It 
also poses a relation between variables 
requiring explanation. 
We may take it that the objective of 
a science of behavior is the explication 
of the relations implicit in the equation 


R= f (4, 6, ¢, €...%... 
t...%,y,2). (9) 





One as ose 


: 
i 
; 
' 
: 
; 


118 C. H. GraHAM 


In this equation, the first letters of the 
alphabet (a, 5, c, etc.) refer to properly 
specified aspects of stimuli; the last 
letters (.. . x, y, 2), to properly speci- 
fied conditions of the organism; R, to 
response; m, to number of presenta- 
tions; and ¢, to time. In what follows, 
the letters F, , f,, and f, imply a func- 
tional relation between the left hand 
term of the particular equation referred 
to and the terms in parentheses. 

Any psychophysical function may be 
described as a special case of (9); i.e., 


R = F(a), (10) 


where all the variables of (9), except 
R and a, are constant. The effects of 
instruction stimuli are presumably speci- 
fied by constant values of x, y, or z. A 
full specification of these latter terms 
may involve other cases of equation (9) 
that describe conditions due to verbal 
stimulation and their effects in time, ¢. 

When an experiment requires the de- 
termination of several psychophysical 
functions to give a particular “percep- 
tual” function, as in the case of in- 
tensity discrimination, we note how one 
aspect, 5, of the stimulus situation (e.g., 
the threshold increment in intensity, 
Al,) varies as a function of another, a 
(e.g., the adapting intensity, 7). Thus, 
for constancy in all variables except a 
and 6 (but including R), 


b= $(a). (11) 


For the case where a discrimination 
depends upon a “condition,” x, of the 
organism (e.g., “emotion”), 

R = f,(x). 

Finally, in terms of equation (9), it 


is of some interest to observe that the 
case of learning reduces to 


R = f,(n), 


where m refers to number of presenta- 
tions of the constant stimuli a, #, etc., 
and other variables assume constant 


(12) 


(13) 


parametric values. “Reinforcement” is 
presumably a specified value of x, y, or 
z traceable to the action of properly 
specified but constant stimulus variables 
(i.e., the reinforcing agent). 

It is readily enough seen that the 
precise specification of relations implied 
by equation (9) will require a program 
of research and theoretical analysis. 
Theories of sensory activity (Hecht, 
18; Granit, 13; Stevens and Davis, 32), 
that tell something of initiating events 
and limiting conditions, will no doubt 
be important in describing important 
relations; and behavior theories (Hull, 
20; Skinner, 30), that describe condi- 
tions of the organism, will be valuable. 
In any case, what must be attained is an 
explicit statement of the relations exist- 
ing among stimulus, response, and what- 
ever hypothesized variables, physiologi- 
cal and inferred, become logically neces- 
sary. The only criterion of the value of 
an hypothesized “condition” of the or- 
ganism centers on the question: Does 
it play a fruitful rdle in a test of theory? 


SUMMARY 


1. Attention is given to some prob- 
lems concerning stimulus and response 
in the psychophysical experiment. The 
role of instructions, classification of 
stimuli, and limitations of the subject’s 
verbal response are discussed. 

2. An examination is made of the 
stimulus-response relations of the fol- 
lowing psychophysical methods: con- 
stant stimuli, single stimuli, limits, ad- 
justment, sense ratios, and reaction 
time. Representative stimulus-response 
functions are described. It is concluded 
that the behavior evidenced in the psy- 
chophysical experiment is discrimina- 
tion. 

3. Some historical aspects of psycho- 
physical theory are considered and 
criticized. 

4. An attempt is made to relate the 
data of the psychophysical experiment 





BEHAVIOR, PERCEPTION AND THE PsycHopHysIcAL METHODS 


and perception to the field of behavior 

theory. It is shown in equation (10) 

that psychophysical functions are de- 

scribable as special cases of the general 
behavior function of equation (9). In 
the special case a specifiable aspect of 

a stimulus is the essential variable, and 

terms representing number of presenta- 

tions of stimuli, time, and specifiable 
conditions of the organism are constant. 

In certain treatments that involve 

several psychophysical functions (e.g., 

intensity discrimination), one variable 

aspect of the stimulus is treated in equa- 
tion (11) as a function of another as- 
pect of the stimulus, the response and 
all other variables remaining constant. 
REFERENCES 

1. Baker, H. D. The course of foveal light 
adaptation measured by the threshold 
intensity increment. J. Opt. Soc. Amer., 
1949, 39, 172-179. 

. Baker, K. E. Some variables influencing 
vernier acuity. J. Opt. Soc. Amer., 
1949, 39, 567-576. 

. BercMann, G., & Spence, K. W. The 
logic of psychophysical measurement. 
Psycuor. Rev., 1944, 51, 1-24. 

. Bortnc, E. G. A history of experimental 
psychology. New York: D. Appleton- 
Century Co., 1929. Pp. xvi + 699. 

. Detsorur,J. Eléments de psychophysique. 
Paris: Germer Bailliére et Cie., 1883. 

. Fecuner, G. T. Elemente der Psycho- 
physik, Vol. I. Leipzig: Breitkopf und 
Hertel, 1889. Pp. xvi + 346. 

. Fry, T. C. Probability and its engineer- 
ing uses. New York: D. Van Nostrand 
Co., 1928. Pp. xiv + 476. 

. Futterton, G. S., & Catrett, J. McK. 
On the perception of small differences. 
Philadelphia: University of Penn. Press, 
1892. Pp. 159. 

. GraHaM, C. H. Psychophysics and _ be- 
havior. J. gen. Psychol., 1934, 10, 299- 
310. 

——. Visual perception. In A handbook 
of experimental psychology (Ed., S. S. 
Stevens). New York: John Wiley. 
(In preparation.) 

. ——, & Hartiine, H. K. The response of 
single visual sense cells to lights of dif- 
ferent wave lengths. J. gen. Physiol., 
1935, 18, 917-931. 


119 


12. ——, & Riccs, L. H. The visibility curve 
of the white rat as determined by the 
electrical retinal response to lights of 
different wave-lengths. J. gen. Psychol., 
1935, 12, 279-295. 

. Granit, R. Sensory mechanisms of the 
retina. New York: Oxford Univ. Press, 
1947. Pp. xxiii + 412. 

. Gumrorp, J. P. Psychometric methods. 
‘New York: McGraw-Hill, 1936. Pp. 
566. 

. Harting, H. K. Intensity and duration 
in the excitation of single photoreceptor 
units. J. cell. & compar. Physiol., 1934, 
5, 229-247. 

. Hecut, S. Intensity discrimination and 
the stationary state. J. gen Physiol, 
1923-4, 6, 355-384. 

A theory of visual intensity dis- 

crimination. J. gen Physiol., 1935, 18, 

767-789. 

Rods, cones and the chemical basis 
of vision. Physiol. Rev., 1937, 17, 239- 
290. 

—, Snraer, S., & PIRENNE, 

Energy, quanta and vision. 
Physiol., 1942, 25, 819-840. 

. Hurt, C. L. Principles of behavior. New 
York: Appleton-Century, 1944. Pp. 
x + 422. 

. Jonannsen, D. E. The principles of psy- 
chophysics with laboratory exercises. 
Saratoga Springs, New York, 1941. 
Pp. xvi + 243. 

. Jounson, H. M. 
introspectional sensation? 
Rev., 1929, 36, 257-284. 

Some properties of Fechner’s ‘In- 

Psycuor. Rev., 


M. H. 
J. gen. 


Did Fechner measure 
PsycHoL. 


tensity of Sensation.’ 
1930, 37, 113-123. 

. Ketter, M. The relation between the 
critical duration and intensity in bright- 
ness discrimination. J. exp. Psychol, 
1941, 28, 407-418. 

. Konic, A. & Bropuun, E. Experimentelle 
Untersuchungen iiber die psychophysi- 
sche Fundamentalformel in Bezug auf 
den Gesichtsinn. Sitzber. d. Akad. d. 
Wiss. Berlin, 1889, 641-644. 

. Merket, J. Die Abhiangigkeit zwischen 
Reiz und Empfindung. Phil. Stud., 
1888, 4, 541-594. 

. Muetrer, C. G. Numerical transforma- 
tions in the analysis of experimental 
data. Psychol. Bull., 1949, 46, 198-223. 

Frequency of seeing curves at vari- 

ous adapting illuminations in intensity 

discrimination. (In preparation.) 





120 


29. , & Lioyp, V. V. Stereoscopic acuity 
for various levels of illumination. 
Proc. Nat. Acad. Sciences, 1948, 34, 
223-227. 

30. Skinner, B. F. The behavior of organ- 
isms. New York: Appleton-Century 
Co., 1938. Pp. ix + 457. 

31. Sternman, A. R. Reaction time to change. 
Arch. Psychol., 1944, No. 292. 

2. Stevens, S. S., & Davis, H. Hearing. 
New York: Wiley, 1938. 

3. Stevens, S. S., & VotkmMann, J. The 
relation of pitch to frequency: a revised 
scale. Amer. J. Psychol., 1940, 53, 

329-353. 

. Trrcuener, E. B. Experimental psy- 

chology, Vol. Il. Quantitative experi- 

ments. PartI. Student’s manual. New 

York: Macmillan Co., 1927. Pp. xli 

+ 208. 


C. H. GraHAM 


35. ——. Experimental psychology, Vol. II. 
Quantitative experiments. Part II. 
Instructor’s manual. New York: The 
Macmillan Co., 1905. Pp. clxxi + 453. 

36. Ventar, F. A. Difference thresholds for 
shape distortion of geometrical squares. 
J. Psychol., 1948, 26, 461-476. 

37. VoLKMANN, J. The method of single 
stimuli. Amer. J. Psychol. 1932, 44, 
808-809. 

38. Wever, E. G., & Zener, K. E. The 
method of absolute judgment in psycho- 
physics. PsycHot. Rev., 1928, 35, 466- 
493. 

WoopwortH, R. S. Experimental psy- 
chology. New York: Henry Holt and 
Co., 1938. Pp. xi+ 889. 

. Wricut, W. D. Researches on normal and 
defective color vision. St. Louis: The 
C. V. Mosby Co., 1947. Pp. xvi + 383. 


[MS. received July 18, 1949] 





DESCRIPTION OF THE LEARNED RESPONSE 
IN DISCRIMINATION BEHAVIOR 


BY HENRY W. NISSEN 


Yerkes Laboratory of Primate Biology and Yale University 


The descriptive task of science is pri- 
marily one of abstraction or classifica- 
tion. Irrelevant differences are set 
aside to make possible a grouping of 
the data according to major significant 
identities or similarities. The decision 
as to what differences are significant 
and which ones may be disregarded as 
incidental has far-reaching consequences 
which delimit the theoretical structure 
into which the facts may be eventually 
organized. Any systematization, erected 
prematurely on inadequate classifica- 
tions of the empirical data, will be 
wasteful. For psychology, which, as 
Northrop (9) points out, is still pre- 
dominantly in the second or natural 
history stage of science, the proper 
selection of descriptive categories is of 
fundamental present-day importance. 

Inclusion of all pertinent facts in a 
minimum number of classes has the 
obvious advantage of facilitating sub- 
sequent mathematical-theoretical ma- 
nipulations (7, 14). Its danger lies in 
the necessary exclusion of distinctions 
which may be even more important than 
those on which the actually used divi- 
sions are based. In psychology, reduc- 
tion in number of classes frequently has 
been carried to its penultimate extreme, 
as is exemplified by a variety of 
commonly used dichotomies: innate- 
acquired, subliminal-supraliminal, right- 
wrong, yes-no. 

The particular concern of this paper 
is with two sets of terms extensively em- 
ployed in reporting behavioral data 
from animal studies involving two- 
choice problem situations. They de- 
scribe the response as turning right 
versus turning left or, alternatively, as 


approach versus avoidance. (These al- 
ternatives have been designated as 
“movements” or “motor patterns,” and 
“cathexes” or “acts,” by Tolman and 
Guthrie, respectively.) Usually both 
sets of terms are accepted, one or the 
other being applied according to the 
kind of situation under consideration 
(5, 13). Some writers use the two sets 
so indiscriminately, and even inter- 
changeably, as to imply synonymy. 
This identification is erroneous and may 
have serious consequences. Even intro- 
spective evidence suggests a critical dif- 
ference: “reaching towards the right” 
(when green happens to be at the right, 
red at the left), is not the same thing 
as “approaching (or reaching towards) 
green.” 

There are certain experiments in 
which differentiation is most conven- 
iently described in terms of discrete 
muscular movements—movements whose 
only spatial reference is to the subject’s 
own body. Such responses do not lend 
themselves readily to the approach- 
avoidance formulation. This would be 
the case, for instance, in an auditory 
discrimination in which the subject must 
make a left turn (or must press the 
telegraph key on the left) in response 
to one sound, and must make a right 
turn (press the right key) when a dif- 
ferent sound is heard. On the other 
hand, the difficulties and inconsistencies 
inherent in the left-right formulation 
when applied to other instances of dis- 
crimination behavior, especially in the 
visual field, have not been fully recog- 
nized (although some writers have 
found it convenient to make general 
use of the approach-avoidance classifi- 


121 





: 
: 
: 
‘ 
: 
: 
} 
5 


122 Henry W. NISSEN 


cation). In the interest of simplifica- 
tion or economy, consequently, there 
has been a tendency to describe all two- 
choice discriminations as orientational 
(e.g., left-right) responses. The follow- 
ing two instances will serve to illustrate 
this trend. 

In 1938 Gulliksen and Wolfle (1) 
presented a provocative and mathemati- 
cally precise “theory of learning and 
transfer.” In this theory the response 
is defined “as being a jump to the right 
or a jump to the left, and the stimulus 
as consisting of the entire configuration 
presented on a given trial” (1, p. 244). 
It may be pointed out here 1) that there 
is no more a priori justification for this 
assumption than for the alternative one 
which defines the responses as approach 
to the positive and avoidance of the 
negative stimulus-object, and 2) that 
many of the predictions or deductions 
from a theory based on the second al- 
ternative would be quite different than 
those deriving from the Gulliksen and 
Wolfle formulation. 

More recently several writers have 
proposed neural structures, analogous 
to mechanical devices, as offering an 
explanation for the workings of the cen- 
tral nervous system in guiding and 
regulating behavior. Thus Householder 
(3) presents a fairly simple structure 
for discrimination, a rather more com- 
plicated one for transposition. In both 
cases the neurophysiological postulates 
are few in number and are, for present 
purposes, unobjectionable. But in the 
discrimination structure the resultant 
behavior is described as approach- 
avoidance, whereas in transposition it 
appears as jumping to the left or right. 
Since, by definition, transposition fol- 
lows discrimination without further 
training, and since discrimination is a 
requisite for transposition, it seems rea- 
sonable to expect and demand uniform- 
ity of the terms used in describing these 
interdependent functions. Perhaps ex- 


pediency, or a desire to make the dia- 
grams attractively simple, may have 
motivated the shift in basic terms. Dia- 
gramming discrimination in terms of 
left- and right-going responses would 
(according to my translation) require an 
increase of the theoretically minimum 
number of cells from 4 to 10. A struc- 
ture accounting for transposition in ap- 
proach-avoidance terms would necessi- 
tate no less than 28 cells with a great 
complexity of excitatory and inhibitory 
interconnections... Of the two possi- 
bilities for obtaining consistency of de- 
scriptive terms, the latter seems prefer- 
able if not necessary: according to 
Householder’s unmodified transposition 
structure, a shift in the spatial arrange- 
ment of the stimuli from left-right to 
up-down would result in complete loss 
of the habit. And if the subjects were 
forced to look at the stimuli from an 
upside-down position, they should now 
respond positively to the stimulus which 
was negative during training. 

If an animal has learned a left-going 
response to the stimulus configuration 
WB, a right-going response to BW— 
if it has learned this and nothing more 
—then one would not expect consistent 
response * to W or B when the objects 
are presented one above the other: 
W/B or B/W. If, instead, the animal 
has learned approach to W, avoidance 
of B, little disturbance would be ex- 
pected when the spatial relationship is 


1 This structure was worked out by Robert 
A. Blum and Josephine S. Blum of these 
Laboratories. In addition to the postulates 
used by Householder, it depends on precise 
timing relationships in the firing and refrac- 
toriness of the cells involved. The modified 
structure would account for complete transfer 
from the left-right to one other spatial ar- 
rangement (e.g. up-down), but additional 
similar nets would be necessary to permit con- 
sistency of response when the stimuli are 
placed in still ether relationships. 

2 Except, possibly, on the basis of a residual 
preference, present before training began, for 
W or B. 





DESCRIPTION OF LEARNED RESPONSE IN DISCRIMINATION BEHAVIOR 


thus changed.’ (Any minor disturbance 
that did occur might be referred to one 
of several “extrinsic’”’ factors: emotional 
upset following change in a familiar 
situation, shift in the habitual motor 
pattern, and so on.) The specific aim 
of the following experiment was to de- 
termine whether chimpanzees, trained in 
a given discrimination habit with the 
stimulus-objects in one spatial arrange- 
ment, then respond indiscriminately or 
consistently (to the previously rewarded 
object) when the discriminanda are pre- 
sented in an entirely different spatial 
relationship. 


APPARATUS AND PROCEDURE 


Two sets of stimulus-plaques were used: 
1) a plain black (B) and a plain white (W) 
square surface; 2) a one-inch, black-edged 
red stripe, one running horizontally (H) 
and one vertically (V), each centered on a 
square tan surface. 

The two plaques (black and white, or 
horizontal and vertical) were presented in 
a box-like holder shown in the figure. The 
subjects, reaching with their hands through 
the wire mesh or bars of the cage, made 
their choices by pushing the plaque back- 
wards in:-the holder. This made accessible 
the food-well which was baited or not ac- 
cording to correctness of choice. A small 
piece of banana or apple was used as re- 
ward. The holder could be presented so 
that the plaques were above and below each 
other, as shown in the figure, or, by rotat- 
ing 90 degrees, so that they were to the 
left and right of each other. Two sets 
of food-wells were available, as shown. 
The holder rested on a removable plat- 
form, fastened to the animal cage, and the 
distance of this platform from the floor 


3 The issue here under discussion is not 
directly concerned (a) with whether the cue 
is “spatial” or “non-spatial,” nor (b) with the 
question as to whether the “absolute” or 
“relative” properties of the stimulus-objects 
provide the effective cue. The problem of de- 
scribing the critical response as approach- 
avoidance or as discrete muscular movements 
remains, whatever may be the facts and their 
interpretation in respect to either of these 
questions. 


123 












































ba F 
7 
he 


Representation of apparatus used for all 
learning and transfer trials. Position shown 
here presents stimulus-plaques B and W (black 
and white) in up-down relationship. Sheet 
metal edging (light inner lines) slightly over- 
laps stimulus cards, exposing a 474” square 
area. F=Food-well. Outside dimensions: 
64" wide, 12%” high, 9%” deep. 











was adjusted so as to provide minimum 
variation in height of the plaques when the 
holder was in up-down and left-right posi- 
tions. During baiting and changing of the 
stimuli, the holder was drawn back, out 
of reach of the animal, and for response 
it was pushed up against the cage. The 
plaques were in place at time of baiting 
and the experimenter always reached to- 
wards both food-wells, from behind, in a 
uniform order. Other precautions, includ- 
ing the baiting of both food-wells, insured 
that the subjects were not responding on 
the basis of secondary cues. ‘The correc- 
tion technique was used, allowing a second 
response (following an error) a few sec- 
onds after the first one. An experimental 
session consisted of 25 trials and there 
was usually one session per day. 





ee 





124 Henry W. NISSEN 


Prior to any specific training or learning 
sessions, each subject was given from 8 
to 16 adaptation trials in which he was 
allowed to take a piece of fruit from each 
of the four food-wells. A minimum of 
two, and a maximum of four responses to 
each food-well were permitted. During 
these adaptation trials “neutral” plain 
brown plaques were substituted for the 
stimulus-cards described above. 

Ten animals (Group A) were trained 
on a discrimination habit (7 of them with 
the stimuli in the left-right positions, 3 
with the up-down arrangement) and were 
then tested with the same stimuli presented 
in the other spatial arrangement. The 
tests were given after the subject had made 
not more than one error in two successive 
sessions of 25 trials each (i.e., 49 out of 


50 correct) with the initial set-up. It is 
to be noted that the subjects had had no 
prior training—at least in this experiment 
or with this apparatus—with the spatial 
arrangement used in the transfer tests. 
Depending on the accuracy of their per- 
formances, these animals were given from 
two to four sessions each (50 to 100 trials) 
in the transfer situation; 7.e., until they 
had again met the criterion of 49 correct 
responses in 50. Results are presented in 
Table 1. 

Five chimpanzees (Group B) were 
trained on one discrimination habit in the 
left-right situation, on a different habit in 
the up-down arrangement, before being 
given any transfer tests. Having met the 
criterion of mastery (in this case 48 out 
of 50 correct) on the first habit (left-right 


TABLE 1 


Learning, transfer, and savings scores for animals tested on transfer to either up-down 
or left-right before having had any specific training experience with that spatial arrangement 
(Group A). Learning and transfer scores do not include sessions in which criterion was met. 








Learning | 


Transfer Percentage savings 





Discrimination; 
subject | 
Trials 


Errors | Trials | Errors Trials 


Errors 





(1) Training left-right; testing up-down 





0 
34 
31 
47 


Nm — 
sob 
uns 


43 
58 
44 


_— 
nNons 
uwuwuv 








0 
5 
5 


1 








(2) Training up-down; testing left-right 





W vs. B 
Nira 250 
B vs. W 
Alf 75 
H vs. V 
Wendy 400 





Mean (N = 10) 125 














W = White; B = Black; H = Horizontal; V = Vertical. 
* See text for explanation of Buff's learning scores. 





DESCRIPTION OF LEARNED RESPONSE IN DISCRIMINATION BEHAVIOR 


125 


TABLE 2 


Learning, transfer, and savings scores for animals trained on one left-right and one up- 
down discrimination before being tested for transfer to the alternative spatial arrangement 


(Group B). 


Learning and transfer scores do not include sessions in which criterion was met. 








Learning 
Discrimination; 


Transfer Percentage savings 





subject 


Trials Errors 


Trials Errors Trials Errors 





(1) Training left-right; testing up-down 





W vs. B 
Bokar (1) 50 
Alpha (1) 12 

B vs. W : 
Pati (1) 5 35 
Wendy* (2) 8 

V vs. H 
Don (2) - 103 

H vs. V 
Ami (2) 150 46 
Alf** (2) 125 40 


25 


50 
0 0 





Training up-down; testing left-right 





W vs. B 
Don (1) 2 37 
Ami (1) 125 42 

B vs. W 
Dina*** (1) 22 

V vs. H 
Pati (2) 20 61 
Bokar (2) 2 76 

H vs. V 
Alpha (2) 78 

133 | 47 








Mean (N = 13) 











11.5 





| 

| 

a 97 
| | 





W = White; B = Black; H = Horizontal; V = Vertical. 
Figure in parentheses after subject’s name indicates whether this was first or second 


habit learned. 


* Wendy was trained in up-down and tested in left-right situation on H vs. V before this 


training on B vs. W was begun. 


See Table 1. 


** Alf was trained in up-down and tested in left-right situation on B vs. W before this 


training on H vs. V was begun. See Table 1. 


*** Dina had 225 trials on V vs. H in the left-right position, without meeting criterion or 
making progress, before the transfer tests on B vs. W were given. 


for 3 animals, up-down for the other 2), 
they were trained, to the same criterion, 
on the second habit. Then a single session, 
consisting of 12 successive trials on each 
of the two habits—each in its original 
spatial arrangement—was given before the 
transfer tests were begun. (No animal 
made more than one error in these final 
24 learning trials.) The tests were also 


given by the split-session method: 12 (or 
13) trials on the first habit in its alterna- 
tive spatial relationship, followed by 12 
(or 13) trials on the second habit in its 
alternative arrangement, until the criterion 
of 48 correct trials on each was met. Two 
or more sessions of transfer tests were 
given each subject. Results are shown in 
Table 2. 








126 


Finally, two of the subjects of Group A 
(Alf, Wendy) were trained and then tested 
on a second habit after completion of train- 
ing-testing on the first. The results are in- 
cluded. in Table 2, since they represent 
transfer after experience (on a different 
habit) in the test situation. Included here 
also are the results of Dina who had re- 
ceived training, but had not reached the 
criterion, on a previous stimulus-pair (V 
vs. H) presented in the spatial arrange- 
ment (left-right) of the transfer tests of 
the second (B vs. W) habit. 


RESULTS 


Results obtained with the Group A 
animals, who were tested before they 
had any training experience with the 
spatial arrangement of the specific 
transfer situation here used, are pre- 
sented in Table 1. Data of the Group 
B subjects, tested after experience with 
the transfer situation but in connection 
with a different stimulus-pair, are given 
in Table 2. In 10 of the 13 cases in- 
cluded in Table 2, the subject was tested 
“simultaneously” on each of two dis- 
criminations presented for transfer in a 
spatial arrangement other than that 
used during the original learning. 

The savings scores are large in all 
cases and give convincing evidence for 
a high degree of transfer from the left- 
right to the up-down situation and vice 
-versa. In 13 of the 23 cases (Tables 1 
and 2) transfer was complete and im- 
mediate (no errors). (Since the cri- 
terion of mastery is based on perform- 
ance within sessions of 25 trials, the 
scores usually show greater savings in 
errors than in trials.) 

The data indicate that the V vs. H 
and H vs. V habits are more slowly 
learned, and produce more errors in 
transfer, than the W vs. B and B vs. W 
habits. No consistent differential ef- 
fects of spatial arrangement (left-right 
vs. up-down) are seen in the learning 
scores. There is a suggestion that trans- 
fer from left-right to up-down may be 


Henry W. NISSEN 


a little more difficult than the reverse; 
this could be explained by the fact that 
all subjects had had previous experience 
(in other experiments) with the left- 
right but not with the up-down place- 
ment of two choice-objects. 

Transfer errors seem to be associated 
with emotional tension and perhaps with 
age. Four of the subjects were about 
3 years old: Verb, Dehn, Hank, and 
Buff; all were exceptionally tense and 
three of them contributed the largest 
transfer error scores in Table 1 (5, 9 
and 15 errors, respectively). All other 
subjects were from 7 to about 30 years 
old, and among these the worst transfer 
scores were contributed by notably 
tense individuals (Bokar, Pati, Don, 
Ami), some of whom perhaps associated 
this apparatus with a rather similar one 
in which they recently had received 
electric shock. 

The performances of Buff and Hank 
deserve special mention. At the age of 
about six months, soon after they were 
able to walk, each of these animals was 
trained in a discrimination habit. The 
experimental set-up was quite different 
from that of the present study, requir- 
ing locomotion to the left or right, de- 
pending on which of two adjacent, cur- 
tained passage-ways was marked with 
a white and which one with a black 
stimulus-patch. Buff had mastered a 
white vs. black, Hank a black vs. white 
habit. (Note that these are the same 
stimulus-values shown in Table 1.) 
The present study was done approxi- 
mately 25 months after their last pre- 
vious experiences with this earlier situa- 
tion. It seems clear that Buff had re- 
tained the effects of his early training 
over this two-year interval; he re- 
sponded to white without exception in 
both learning and training trials. Hank 
chose black in 21 of the first 25 trials, 
suggesting considerable retention. Dur- 
ing the next 3 sessions, however, he be- 
came very jittery and made chance 





DESCRIPTION OF LEARNED RESPONSE IN DISCRIMINATION BEHAVIOR 


scores, after which he improved rapidly. 
In the first transfer session Hank made 
13 errors, in the second session only 2. 

Four of the subjects had had brain 
operations some two years before: pre- 
frontal removals in the case of Scarf, 
Mars, and Nira, prefrontal lobotomy in 
the case of Ken. The transfer scores of 
all four animals are within the range of 
the other subjects. Ken and Nira were 
exceptionally slow in learning W vs. B; 
in both cases there was a long plateau in 
the learning curve at about the 85 per 
cent level of accuracy. 


DISCUSSION 


The specific question raised at the 
beginning of this paper has received a 
clear-cut answer: The fact that a visual 
discrimination habit learned in one 
spatial arrangement showed little or no 
disturbance when the stimulus-objects 
were presented in a quite different posi- 
tional relationship indicates that the 
original performance must be described 
as approach to (or avoidance of) abso- 
lute or relational visual properties of the 
stimuli, rather than as two discrete 
movements, differentially elicited by 
each of two distinct total configurations. 
The perfect transfer scores of Mars, for 
instance, are explicable if we describe 
his learning diagramatically as 


Sw hite Ra voidance 


Spiack > Rapproacn 


but not if we describe it as 
Riett 


This finding does not exclude the pos- 
sibility that both sets of connections or 
associations were established during the 
learning period, but it does show that 
approach-avoidance was independently 
effective. The present results pertain 
to particular chimpanzees with certain 
background experiences, observed in a 
specific experimental situation, and 
tested after a given amount of over- 
training. How generally they may hold 


Spiack-white Swhite black Rrignt 


127 


under other conditions remains to be 
discovered. The results obtained are 
consistent with common sense—to the 
layman the present study must appear 
as another demonstration of the obvious 
—and also with some earlier experi- 
mental work (8) which suggested that 
the critical differentiation in visual de- 
layed response is of the external field 
rather than of the required muscular 
responses. Nevertheless, we must also 
take into account the fact that some 
discrimination responses (see introduc- 
tion) are more directly describable in 
terms of discrete movements.* 
Logically there seem to be three pos- 
sibilities: 1) By extensive experimenta- 
tion the immediate appropriateness of 
one or the other descriptive formulation 
may be determined for various animal 
forms, various discrimination situations, 
differing degrees of mastery, and so on. 
Whatever analogical or other explana- 
tory conceptualization is adopted would, 


4In this paper we are dealing with the es- 
tablishment of a “preference” for one of two 
stimulus-objects, or of consistent differential 
movements to differing stimulus-situations, in 
organisms with certain potentialities and as- 
sociations already given innately and/or by 
previous experience. Chief among these pre- 
requisite and taken-for-granted behavioral 
characteristics are (1) differential effects on the 
nervous system of the discriminanda, i.e., dis- 
criminability of the pertinent stimuli; (2) 
“object-” or “thing-perception,” such that a 
given object elicits the same perception under 
varying conditions or contexts (e.g., of dis- 
tance, illumination, background) ; (3) sensory- 
motor coordinations such that a given percep- 
tion does (or through experience may come 
to) evoke a response which is always “the 
same” in reference to (a) the spatial relation- 
ships of organism and object (e.g., the animal 
makes appropriate muscular movements to 
approach food, regardless of whether the food 
is to the left or right, near or far), or (b) 
its own bodily relationships (e.g., if its right 
arm is broken, it now scratches the same itchy 
spot with its left hand). Any attempt to trace 
the origins of such features of behavioral ad- 
justment in our discrimination-experiment sub- 
ject is beyond the scope of the present dis- 
cussion. 








128 


of course, have to be consistent with the 
applicable descriptive categorization. It 
seems probable that, for the higher or- 
ganisms at least, both of the binary 
classifications considered in this paper 
would find application, according to the 
nature of the discrimination situation 
in question. On the explanatory level, 
this might necessitate the postulation of 
two distinct mechanisms for discrimina- 
tion within the same individual. 2) It 
may be that a new binary or multiple 
classification can be found which would 
comprehend all the empirical data here 
grouped under one of two distinct sets 
of descriptive terms. That is, the dis- 
tinction between approach-avoidance 
and discrete, orientational movements, 
may not be “real.” If such a synthesis 
were made on the descriptive level, a 
single inclusive explanatory conceptual- 
ization would account for all the perti- 
nent facts considered in the present dis- 


cussion. 3) Perhaps one of the two 
binary classifications considered in this 
paper can be “reduced” to the other, 
can be shown to be a special case of the 


second one. This solution likewise 
would have the advantage of making 
possible the explanation of all two- 
choice discrimination learning by a 
single mechanism. 

The remainder of this discussion will 
examine the third one of the aforemen- 
tioned alternatives. I shall try to sug- 
gest how the kinds of discrimination 
learning which at first thought do not 
seem to fit the approach-avoidance for- 
mulation, may nevertheless be appro- 
priately described in those terms. The 
first step will be to see if the distinc- 
tion between our two sets of terms can 
be sharpened. 

The term “approach” is, by itself, 
meaningless; it calls for a supplemen- 
tary statement of what is being ap- 
proached. Avoidance, likewise, is avoid- 
ance of something. That something is 
usually an object, a region or a condi- 


Henry W. NISSEN 


tion of the environment, frequently 
called the goal. Approach and avoid- 
ance, therefore, are terms used to de- 
scribe an act, or goal-directed behavior. 

Left-going and right-going, by con- 
trast, are descriptions of movements, 
without essential reference to the en- 
vironment or to the effect of the move- 
ment. This part (of the body) moves, 
or that part; this part flexes or extends; 
the body as a whole turns on its vertical 
axis leftwards or rightwards. 

We must here digress briefly in order 
to avoid a deceptive semantic confusion. 
Any description of response which 
makes essential references to the en- 
vironment (to a goal) is description of 
an act to which the approach-avoidance 
formulation applies. This is true even 
when, as a short-hand convenience, we 
speak of the subject as “going to” or 
“manipulating something” which is des- 
ignated by its spatial characteristics— 
e.g., as being to or of something on 
the left side of a specific environment. 
Consider the type of experiment in 
which an animal is required to re- 
spond to one or the other of two 
places (visually similar boxes, doors, 
buttons) in accordance with stimuli 
having no differential spatial charac- 
teristics. Auditory problems (in which 
the subject must respond to the left for 
one sound, to the right for a different 
sound) and T-maze studies in which 
rats are trained to make one turn when 
hungry, the opposite turn when thirsty, 
are examples. Although the response- 
objects may be visually very similar or 
identical, they are usually differentiable 
exteroceptively by one or both of the 
following types of cue: a) background 
features, including direction and dis- 
tance from conspicuous objects such as 
lights and windows; b) kinaesthetic 
stimuli resulting from orientation in re- 
spect to background features or in re- 
spect to initial posture based on a con- 
stant point of entrance to the experi- 





DESCRIPTION OF LEARNED RESPONSE IN DISCRIMINATION BEHAVIOR 


mental situation. Here the response is 
described with reference to the environ- 
mental situation. If, on the other hand, 
the response is described only as a given 
muscular contraction or movement, with 
no consistent and necessary effect on 
the organism-environment relationship, 
the approach-avoidance formulation is 
no longer directly applicable. 

On first thought, increased respiration 
rate, salivation, or hand-clapping appear 
to be descriptions of response in terms 
of “pure” movement. There is no ob- 
vious goal-object or altered organism- 
environment relation, and most acts are 
describable in those terms. A broader 
and more generally applicable criterion 
of the act, however, is that of a re- 
sponse having a given consequence or 
effect. The salivary response may be 
thought of purely as a secretory activity, 
but it may also be conceived of as the 
production of saliva. Hand-clapping 
is a muscular coordination, but it re- 
sults in cutaneous, kinaesthetic, and 
auditory sensations. This point will be 
discussed further below. 


In the auditory problem mentioned 
above, the a- and b-type cues, differentiat- 
ing the response-objects exterocéptively 
(i.e., by reference to the environment), 
are not sufficient to solve the problem; it 
is the spatially neutral stimulus of sound 
X (rather than sound Y) which determines 
that response should be to the left rather 
than to the right. One may, therefore, con- 
ceive of the cue as consisting of a complex 
stimulus or stimulus-pattern: 


Si) 
i ale 


Sr | 
Sx ite os 


where S: and Sx designate the exterocep- 
tive cues differentiating the alternatives, 
left and right, and R+ and R— represent 
approach and avoidance (or, response and 
no response), respectively. Or one may 
say that a conditional response is required: 
If St and Sx, then R+, but if Sx and Sr 


129 


then R --; if Sx and Sy, then R +, but if 
Sex and Sx, then R—. The fact that such 
problems are in general learned with rela- 
tive difficulty, is in keeping with this in- 
terpretation. The fact that response is 
precisely to a box (or door, or button) 
and not to the usual or approximate loca- 
tion of the box, even when, as in an ex- 
periment on spatial delayed response with 
chimpanzees, (4) the containers are sur- 
reptitiously moved closer together or fur- 
ther apart, is not a critical argument, but 
it is more directly accounted for in terms 
of approach-avoidance than of specific 
muscular mpvements which have no refer- 
ence to a goal or effect. 

Up to this point our discussion has been 
confined to the two-choice discrimination 
situation. The same considerations apply 
in principle to the multiple-choice situa- 
tion, where selection of one box (or door 
or path) of three or more, on the basis of 
spatial relationships or visual characteris- 
tics, is required. As a matter of fact, 
choice of the one positive, from among 
eight closely spaced, similar negative al- 
ternatives, would require rather extreme 
precision of the differential movements 
being learned to each of nine stimulus 
concatenations, and presumably would in- 
volve an increase of difficulty which is not 
borne out by experimental data (6). If 
the eight negatives were all different, the 
subject would have to learn nine responses, 
each of which could be elicited by 40,320 
stimulus configurations! If, on the other 
hand, the subject has learned approach to 
the positive, avoidance of the negative, 
increasing the number of negative objects 
should not appreciably affect difficulty of 
the problem. 

“Successive” discriminations, in which 
only one (of the to-be-discriminated stim- 
uli) appears at a time, offer no additional 
problem in the present context. Usually 
the subject is required to make, or not to 
make, a certain response, and this may be 
described in approach-avoidance terms: 
Sy —R —-. 


Sx —~ R + and 


When the response must be differentially 
oriented in accordance with spatially dif- 
ferentiated stimuli (e.g., pressing the left 











130 


key when a light is flashed on the left, 
the right key when it flashes on the right), 
the behavior is appropriately described as 
approach to the lighted (and avoidance of 
the nonlighted)side. And if discrete move- 
ments are required to stimuli which are 
not themselves spatially differentiated (very 
few instances of this problem have been 
reported in the literature), the interpreta- 
tion of a compound stimulus or conditional 
response is applicable: a yellow light, for 
instance, in combination with certain back- 
ground and/or kinaesthetic stimuli, re- 
quires approach; a blue light in combina- 
tion with the same exteroceptive stimuli 
demands avoidance. 

There are some instances of discrimina- 
tive behavior which fit the approach-avoid- 
ance formulation (or variations thereof, 
such as response vs. no response, or posi- 
tive vs. negative response) but seemingly 
have no goal-reference; the response does 
not obviously alter the organism-environ- 
ment relation. The salivary response in 
classical conditioning would be an illustra- 
tion of this. There are other instances 
which have no goal-reference and which, 
offhand, seem to be describable only in 
terms of differential movements of various 
muscle groups: Thorndike’s cats and mon- 
keys which were rewarded for assuming 
nonsensical postures (12), Skinner’s super- 
stitious cage-pacing and hopping pigeons 
(11), and Schiller’s head-patting chim- 
panzee (10). There is a certain degree of 
discrimination here; the response, being a 
part of the animal’s repertoire of “sponta- 
neous” behavior, may occur at any time, 
but is given more frequently under certain 
conditions: in the presence of the experi- 
menter, or in a particular cage or appara- 
tus. In order to provide an extreme case, 
we may extrapolate from this sort of be- 
havior to the imagined instance of an ani- 
mal which has learned to twitch its nose for 
one stimulus, its tail for a different stimulus. 

In order to describe these latter discrimi- 
native responses as acts, we must apply 
our more general criterion, namely that of 
a consistent consequence or effect. (Prob- 
ably all the aforementioned responses have 
distinct sensory consequences—even when, 
for instance, the saliva does not reach the 
mouth. It has been reported that skeletal 
conditioning does not occur in the curarized 


Henry W. NISSEN 


animal, 2). Patting the head is comparable 
to pushing the red plaque; after training, 
the ape does not sometimes pat its belly, 
and if one hand is occupied in holding a 
piece of food, the other hand is used to 
produce the same result. The response is 
defined in terms of its effect. Stimuli are 
discriminated, not responses. For our hy- 
pothetical extreme case we must include 
local signs among the stimuli which may 
determine a discriminative response. A 
local sign presumably has an already es- 
tablished connection with motor centers 
controlling movement of the same Bodily 
region. With Sw and Sr representing local 
signs for nose and tail, respectively, the 
diagrammatic representation of nose-twitch- 
ing for red (Sr), tail-twitching for green 
(Se) becomes 


Sy | 
ti ad 


S 


/ 


The conditional response is “twitching,” 
and whether the nose or tail twitches de- 
pends on the particular combination of ex- 
ternal and local-sign stimuli. 


CONCLUSIONS 


The. logical necessity, in some in- 
stances, of describing the discriminative 
response as an act, that is, in terms of 
its consequence or effect (usually with 
reference to a goal or an altered or- 
ganism-environment relationship), is il- 
lustrated in the ape experiment reported 


above. Description in terms of move- 
ment, without reference to the environ- 
ment, does not account for the prompt 
transfer shown by sixteen chimpanzees. 
There is no conceivable “equivalence” 
or “family relationship” in either the 
stimulus-situation—white-black, black- 
white black _ 
black’ white ’ 
movements—left, right and up, down 
—which accounts for the generalization. 

I have tried to show the applicability 
and adequacy of the approach-avoid- 
ance formulation for a wide variety of 


white and or in the 





DESCRIPTION OF LEARNED RESPONSE IN DISCRIMINATION BEHAVIOR 131 


discrimination situations, including suc- 
cessive discrimination and problems re- 
quiring selection from multiple alterna- 
tives. For certain types of problem 
we have had to conceive of the behavior 
as involving a conditional response— 
a response determined by a complex 
stimulus, or by the “simultaneous” ef- 
fectiveness of two independent cues. 
In one extreme hypothetical case local 
signs have had to be included among 
these effective stimuli. The plausi- 
bility of these assumptions is supported 
by the experimental finding that such 
problems are relatively difficult, and are 
learned slowly. 

The approach-avoidance formulation 
more adequately accounts for the pre- 
cision of response, especially when 
positions of the choice-objects are varied 
slightly, than does description in terms 
of learned movements. It makes un- 
necessary a second, supplementary 
mechanism (which presumably would 
have to be an approach-avoidance type 
of mechanism) to account for final cor- 
rection and adjustment after the criti- 
cal learned movement has brought the 
subject (his body or his hand) nearer 
to the correct object. It offers a single, 
uniform, consistent set of descriptive 
terms for discriminative behavior. The 
desirability for subsequent theoretical 
elaboration, of such uniformity at the 
descriptive level, was discussed at the 
beginning of this paper. 

The integrations which provide (a) 
perceptions—of things, of direction, dis- 
tance, and so on—and (b) the muscular 
and sensory-motor coordinations of lo- 
comotion and manipulation, are either 
innate or have been acquired in earlier 
ontogeny. With the origins of these 
basic organizations we have not here 
been concerned; we have taken these 
units of integration for granted. Since 
such perceptions and sensory-motor co- 
ordinations are already available, it 
would appear a great deal more efficient 
and effective for the organism to take 


advantage of them,—more efficient be- 
cause establishment of one or two as- 
sociations suffices——more effective be- 
cause object-approach has wide gen- 
erality. A learned movement, on the 
other hand, has by definition no con- 
sistency in reference to a goal or effect. 


REFERENCES 


1. Gutirksen, H., & Wotrte, D. L. A 
theory of learning and transfer. I. 
Psychometrika, 1938, 3, 127-149. II. 
Psychometrika, 1938, 3, 225-251. 

. Hartow, H. F. The effects of incomplete 
curare paralysis upon formation and 
elicitation of conditioned responses in 
cats. J. genet. Psychol., 1940, 56, 273- 
282. 

. Hovsenotper, A. S. Neural structure in 
perception and response. PsyCHOL. 
Rev., 1947, 54, 169-176. 

. Harrison, R., & Nissen, H. W. Spatial 
separation in the delayed response per- 
formance of chimpanzees. J. comp. 
Psychol., 1941, 31, 427-435. 

. Kenpter, H. H., & Gasser, W. P. Vari- 
ables in spatial learning. I. Number 
of reinforcements during training. J. 
comp. physiol. Psychol., 1948, 41, 178— 
187. 

. McCuttocn, T. L., & Nissen, H. W. 
Equated and non-equated stimulus sit- 
uations in discrimination learning by 
chimpanzees. J. comp. Psychol., 1937, 
23, 365-376. 

. McCuttocn, W. S., & Pitts, W. A logi- 
cal calculus of the ideas immanent in 
nervous activity. Bull. math. Bio- 
physics, 1943, 5, 115-133. 

. Nissen, H. W., Carpenter, C. R., & 
Cowtes, J. T. Stimulus- versus re- 
sponse-differentiation in delayed reac- 
tions of chimpanzees. J. genet. Psychol., 
1936, 48, 112-136. 

. Norturop, F. S. C. The logic of the sci- 
ences and the humanities. New York: 
Macmillan, 1948. Pp. xiv + 402. 

. Scuitrer, P. H. Personal communication. 

. Skinner, B. F. ‘Superstition’ in the pi- 
geon. J. exp. Psychol., 1948, 38, 168- 
172. 

. THornpike, E. L. Animal intelligence. 
New York: Macmillan, 1911. Pp. 297. 

. Totman, E. C. There is more than one 
kind of learning. PsycHor. Rev., 
1949, 56, 144-155. 

. Wiener, N. Cybernetics. 
John Wiley and Sons, 1948. 


[MS. received September 13, 1949] 


New York: 
Pp. 194. 














THE 
PSYCHOLOGICAL 


INDEX. 


The Psychological Index is a bibliog- 
raphy of psychologica? literature which 
was published annually for the Ameri- 
can Psychological Association from 
1895 to 1936. Eleven of the forty-two 
volumes are out-of-print: numbers 2, 
3, 4, 5, 9, 12, 18, 20, 22, 25, and 26. 
The remaining 31 issues are priced at 
$2.00 per volume, and sold in sets or 
separately. 


Ten per cent discount is given on 
orders over fifty dollars. 


American Psychological Association 
1515 Massachusetts Avenue N. W. 
Washington 5, D. C. 




















PSYCHOLOGICAL REVIEW 


AVAILABLE NUMBERS 


: 
: 


& 
AACARABAUAAAAABAUBHAABAG! | i ABR! ABauaAal Al aaaali il iallaa 


BOWWWWWWWWWWWUWWWHWWWwWWwWwww Iw) Ww) | WWWwWwwww i wwwiw! it iwi iwi 
$224640404800200082 0800808 1 OOH | OHHRRORORE! BO! 1 BOOTLE TL) tae 
AAAAANGAAAAAADAGAGAGAAGAGAABIAAIAIAGMI I! AGAAGDAGI I IQiati lh iaiataa 


beaten ahah eee al ool eed aed hha hae eh an el el a od ee | ee ie ee 
NNNONNNNNN / NVNONNNNNNNNNNNNNWN! NI ON! | VNNNNN! IN)! VNN! Iw) | fw 


1 

2 

3 

4 

s 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
2s 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
“4 
4s 
46 
47 
48 
49 
so 
51 
$2 
53 
54 
ss 
56 
57 


|| SERREERREEREEEEtoRerenecenrreeererreneeer SEEEEEEssce | be 


; 


toa $239.50 
Discount 


: 


Net price, Volumes 1 through 56 $168.65 
Review which are listed in the above table are for sale. The table is 


Information about prices: the Psychological Review has the uniform price of $5.30 per volume and $1.00 
incomplete volumes, the price is $1.00 for each available namber. For foreign postage, 8.25 
American Psychological Association gives the following diecounts on orders 


109% on orders of $ 50.00 and over 
20% on orders of $100.00 and over 
30% on orders of $150.00 and over 


Current subscriptions and orders for back numbers should be addressed to 


AMERICAN PSYCHOLOGICAL ASSOCIATION 
1515 Massachusetts Avenue N.W. Washington 5, D. C. 





